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UNIVERSAL PRIMERS FOR WILDLIFE IDENTIFICATION 
TECHNICAL FIELD 

The invention relates to the identification of novel universal primers that can amplify the 
fragment of cytochrome b gene of any animal species in polymerase chain reaction (PCR) 
5 and reveal the identity of the biological material of any unknown animal origin at species 
and sub-species sources. The invention also provides a method for the identification of 
fragments on mitochondrial cytochrome b gene in biological material of unknown origin. 

BACKGROUND ART 

A large number of studies in evolutionary biology utilize phylogenetic information 

10 obtained from mitochondrial cytochrome b gene. It has been identified a potent molecule 
to distinguish the phylogenetic depth of different lineages to family, genus and species in 
molecular taxonomy 1 " 66 . A vast database of the sequences of cytochrome b gene of 
different animal species has accumulated in public databases such as GenBank, NCBI 
(http://www.ncbi.nlm.nih.gov) etc. We have utilized this capacity of cytochrome b gene in 

15 establishing the identity of the origin of animal parts and product to its family, genus and 
species sources. The technique developed is based on a pair of universal primer that can 
amplify a small fragment of cytochrome b gene from a vast range of animal species. 
Establishing identity of confiscated animal parts and products is a great challenge to law 
enforcement agencies because none of the methods available till date is too efficient to 

20 reveal the identity of animal remains beyond a reasonable doubt. Morphological markers, 
described for certain species allow the identification of complete specimen of animals 67 
However, a complete specimen is confiscated very rarely by the investigation agencies; 
therefore, these marker are not practical in wildlife forensics. The biochemical traits such 
as the bile characteristics 68 blood heam analysis 69,70 etc. have also been employed in 

25 wildlife forensic for identification of individual species. The difficulty of these markers 
are that these markers are limited in number and are rarely found in their natural forms in 
which these were originally described as the characteristic of a particular species. 
The molecular approaches such as micro-satellite based identification 71 , Restriction 
fragment length polymorphism analysis of mitochondrial genes or PCR based species 

30 specific STS markers require the prior information of the species to establish the 
identity 72 ' 73 . These methods also need a significant amount of DNA material to be 
analysed. We may not have the prior information about the species origin of confiscated 
animal parts and product in forensics, therefore, these methods arc not really useful and 
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practical in wildlife identification. The technique invented by us is universal, therefore 
does not require any background information to establish the identity of any unknown 
confiscated remains at family, genus and species sources. Being a PCR based procedure it 
can be applied with trace amount of any biological material. Because the amplicon length 
5 is small (472 bp); therefore, it can work perfectly with the mutilated remains, which are 
commonly seized by the crime investigation agencies. It does not require the large amount 
of genetic material i.e. DNA to be analyzed to establish the identity, hence, can detect a 
minute amount of adulteration in food products. The procedure described is simple and 
very fast. Due to the said advantages, the procedure invented by us is most suited for 
10 forensic wildlife identification. 

OBJECTS OF THE INVENTION 

The main object of the invention is to identify a fragment on mitochondrial cytochrome b 
gene capable of significantly discriminating among various evolutionary lineages of 

15 different animal species. 

Another object is to identify a fragment on mitochondrial cytochrome b gene which is 
flanked by the highly conserved sequences at a vast range of animal species. 
Yet another object is to detect a fragment on mitochondrial cytochrome b gene which is 
polymorphic inter-specifically, but monomorphic at intra species sources. 

20 Still another object is to develop the universal primers to amplify the fragment on 
mitochondrial cytochrome b gene using polymerase chain reaction. 

Another object is to develop a PCR protocol that works universally with DNA template of 
any unknown origin (i.e. all the animal species). 

Yet another object is to provide a universal method for identification of species of analyzed 
25 material (i.e. the DNA isolated from confiscated animal remain of unknown origin) using 
the public databases such as GenBank, NCBI etc. 

Still another object is to provide a universal method of animal identification to establish 
the crime with the criminal beyond a reasonably doubt. I 
Another object is provide a universal method to establish the identity of biological 
30 materials such as skin, horns etc confiscated from animal poachers, if it is that of an 
endangered species. 

Yet another object is to provide a universal method for establishment of the identity of 
confiscated animal parts and products of endangered animal species for the purpose ot 
production of molecular evidence of animal hunting and related crime in the co'^n of law, 
35 so that the human violation to the wildlife resources could be controlled. 
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Still another object is to provide a universal technique to have an idea of the geographical 
location of the commitment of wildlife crime based on the haplotype of poached animal - 
identified by the universal primer invented. 

Another object is to provide a universal technique of animal identification to detect the 
5 adulteration of animal meat/products in vegetarian food product for the purpose of food 
fortification, by the food fortification agencies. 

Yet another object is to provide a universal technique for detection of the origin of blood 
or blood stains etc collected from the scene of crime related to offences such as murder, 
rape etc, in order to establish the origin of blood found at scene of crime when it sounds as 

10 if criminals have wontedly spread the blood of an animal at the scene of crime, to confuse 
the crime investigation agencies and forensic scientists with human blood. 
Another object is to invent and authenticate a universal technique that can be converted to 
a (a) "MOLECUALR KIT' and (b) 'DNA CHIPS' based application to meet the 
requirements of above obj ectives. 

1 5 SUMMARY OF THE INVENTION 

Accordingly, the invention provides novel universal primers that can amplify the fragment 
of cytochrome b gene of any animal species in polymerase chain reaction (PCR) and reveal 
the identity of the biological material of any unknown animal origin 
DETAILED DESCRIPTION OF THE INVENTION 

20 Keeping in view the above objectives, the cytochrome b gene sequences (1 140 bp) of 221 
distantly related animal species (listed in Table 1) representing various families were 
obtained from public database NCBI ( http://www.ncbi.nlm.nih.gov ). These sequences 
were aligned using the software Clustal X(L8)(NCBl, USA) and a fragment (of 472 bp, 
alignment shown in Table 2) of gene was identified which had all the features mentioned 

25 above under column I, 2 and 3 of sub-heading 'Objectives of invention'. foe the 
identity of this fragment we would like to mention that it includes the nucleotides between 
398 to 869 in Antilope cervicapra and Fells catus; however, 399 to 870 in Homo sapiens 
sapiens species. Except at few positions (marked as star (*) in Table 2, the nucleotide 
sequences of this fragment are highly variable amongst the animal species, giving rise to 

30 their unique molecular signature. These molecular signatures are characteristic of its 
species and form the basis of revealing the identity of the biological material of an 
unknown animal origin by the procedure invented by us. Considering Antilope cervicapra 
as a representative species, the sequence of this fragment is mentioned herewith: 
Mitochondrial cytochrome b gene sequence (398-869 bp) of Antilope cervicapra: 
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^taccatgaggacaaatatctttttgaggagcaacagtcatcaccaatctcctttcagcaatcccatacatcggtacaaacctagtaga 
atgaatctgaggagggttctcagtagataaagcaacccttacccgatttttcgccttccactttatcctcccatttatcattgcagccctt 
accatagtacacctactgtttctccacgaaacaggatccaacaaccccacaggaatctcatcagacgcagacaaaattccattccac 
ccctactacactatcaaagatatcctaggagctctactattaattttaaccctcatgcttctagtcctattctcaccggacctgcltggag 
5 acccagacaactatacaccagcaaacccacttaatacacccccacatatcaagcccgaatgatacttcctatttgcatacgcaatcct 

ccgatcaattcctaacaaactaggagg". 

A pair of universal primer was designed to amplify this fragment in polymerase chain 
reaction (PCR). These primers were named as < mcb398' and 'mcb869' because of its 
property to amplify a region of mitochondrial cytochrome b gene between nucleotides 398 

10 to 869 of Antilope cervicapra, a representative animal species for this invention. We took 
this animal species as representative species because the idea of developing such a novel 
primers came in the mind of inventors while they were working on the genome of this 
animal in Centre for Cellular and Molecular Biology, Hyderabad, India. These primers 
work universally because its 3' end are highly conserved amongst a vast range of animal 

15 species (shown in Table 2). As mentioned above, the DNA fragment (sequence of which is 
shown above) targeted by these primers is highly polymorphic inter-specifically; however, 
it is monomorphic among the individual of same species (Tables 6, 7a, 7b, 7c, .'d and 8, 
respectively). These unique features of the targeted region enable these primers to generate 
the molecular signatures of an individual species; thereby, enabling them to differentiate 

20 amongst the animals of different species (see in Figure lc). The variation within the 
fragment amplified by these primers increase with increasing distances of evolutionary 
lineages of two animals (Table 8). These unique features of the fragment amplified by the 
universal primers 4 mcb398' and 'mcb869' invented by the applicants fulfill the objectives 
of invention. 

25 Thus, the primers invented by us can generate the molecular signature from any biological 
material of unknown animal origin, which actually is the characteristic of its family, genus 
and more precisely, the species. When these signatures are compared in-silico with the 
signatures already available in public databases (viz., GenBank, NCBI database etc) using 
"BLAST software 73 , it indicates identity of the family, genus or species of the analyzed 

30 material, which in turn is confirmed practically by comparing with the reference animate of 
the revealed family, genus or species, by including them in the further analysis by the 
primers 'mcb398' and 'mcb869\ The complete procedure involved in the analyses (the 
word, 'analyses ' should be understood with the stepwise procedure to establish the identity 
of the biological remain of any unknown animal origin for the aims mentioned in columns 
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1-13 under sub-heading 'Objectives of invention') is briefed under 'Examples 5 and 6, 
respectively, as well as illustrated in Figures la, lb and Ic, respectively. 
BRIEF DESCRIPTION OF DRAWING AND TABLES 

Figure la. Illustration of the step-wise procedure involved in analyses. The unknown 
5 biological material i.e. 'adil.flesh' refers to the confiscated skin mentioned in 'Example 6'. 
The arrow marks indicate the stepwise procedure involved. The brief description of Figure 
la is as follows: 

The biological material i.e. the confiscated skin 'adil.flesh' was subjected to DNA isolation 
using the standard procedures 74 . The DNA obtained was amplified using the primers 

10 4 mcb398' and c mcb869' in PCR, fractionated in 2% (w/v) agarose gel, visualized and 
photographed under UV light using Gel Documentation System (Syngene, USA). The lane 
'NT shown in the photograph represents the molecular weight marker (Marker XIII, 
Boehringer mannheim). Lane 1 shows the PCR amplicon (472 bp) obtained from 
'adil.flesh' using primers 'mcb398' and fi mcb869\ The PCR amplicon obtained were 

15 sequenced at both the strand using "ABI Prism 3700 DNA Analyzes, PE-Applied Bio- 
systems). The chromatogram shows the sequences (about 80 bp long, i.e. between 150-230 
bp of sequence (328 bp), revealed from the PCR product of 472 bp length) obtained from 
'adil.flesh'. 

Figure lb. Illustrates the further steps involved in analyses. The sequence (328 bp) 
20 revealed from 'adiLflesh' was subjected to homology search in nr (i.e. non-redundant) 
database of Netional Centre for Biological Information (NCBI), USA. The sequences 
producing significant alignments are shown along with its bits score and E values. It 
indicates the extent of homology amongst the sequence enquired (i.e. the 328 bp sequence 
from adil.flesh) and the sequences registered in nr database of NCBI. BLAST analysis 
25 revealed the highest homology of the sequence revealed from 'adil.flesh' with the 
sequence of Panther -a pardus (gene bank registration number 'AY005809'), indicating the 
identity of adil.flesh as that of a leopard (Panthera pardus) origin. Figure lb further 
illustrates the multiple alignments of the sequences obtained from reference animals (listed 
in Table 5) along with the sequence obtained from 'adil.flesh'. The sequences of 
30 'adil.flesh' is similar to the sequences of 'gzlL' further confirming the identity of the 
source of confiscated remain 'adil.flesh' as that of a Panthera pardus origin. 
Figure Ic illustrates the NJ-tree (Neighbor Joining tree) constructed using CLUSTAL X 
(1.8) from the sequences revealed from 'adil.flesh' and reference animals listed in Tabic 5. 
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The animals belonging to similar species cluster together; however, the animals of 
different species group in different clusters. The confiscated material under investigation 
(i.e. 'adil.flesh') clusters with 'gzlL' (i.e. the known normal leopard 'Panthera pardus 1 ) 
indicating the identity of the species of 'adil. flesh' as that of a Panthera pardus source. 
5 Figure 2 shows the Agarose gel electrophorogram showing the PCR amplicons (472 bp) 
obtained from the reference animals of family felidae listed in Table 5, using universal 
primers *mcb398 and 4 mcb869\ Description of different lanes is as follows: 
Lanes 1-21: The PCR profiles of the animals 1-21, respectively, listed in Table 5, 
Lane 22: The PCR profiles of DNA isolated from confiscated skin of unknown animal 
10 origin 'i.e. adil.flesh' 

Lane 23: Negative control (no DNA) 

Lane M: Molecular weight marker (marker XIII, Boehringer mannheim) 
Figure 3. Shows PCR amplicons obtained from animals listed in Table 9. Th<* primers 
used in PCR are 'AFF' and 'AFR\ The description of different lanes shown is as follows: 
15 Lane 1-4: The PCR profiles of animals 1-4, respectively, listed in Table 9, showing 
amplicons of 354 bp. 

Lane M: Molecular weight marker (marker XIII, Boehringer mannheim) 
Figure 4. Shows PCR amplicons obtained from animals listed in Table 12. This 
experiment demonstrates the universal nature of our primers among a vast range of animal 
20 species. Description of different lanes shown is as follows: 

Lanes 1-23: The PCR profiles of the animals 1-23, respectively, listed in Table 12. The 
PCR product of 472 bp is amplified universally from all the animal species analyzed. 
Lane 24: Negative control (no DNA) 

Lane M: Molecular weight marker (marker XIII, Boehringer mannheim) 
25 Table 1. List of 221 animal species used for In-silico analysis to design the universal 
primers 'mcb39S' and 'mcb869\ Table also demonstrate the T,S scores' of 5 mcW398' and 
'mcb869' for different templates. The descriptions of various symbols used in this table are 
as follows: 

Symbol (#) refers to Number 
30 Symbol (*) refers to the animal species which is either protected species (listed in Wildlife 
(Protection) Act , 1972 (Central Act NO 53 of 1972), or an endangered/rare animal species 
Symbol ( $ P,S/F) refers to Probability of match and Stability of match of primer k mcb398* 
with different templates (i.e. the cytochrome b gene from different species origin). A 
higher P, S score refers to the higher probabilities of significant amplification of specific 
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template by the primer. It is calculated by Amplify (1.2) software. 

Symbol ( V "P,S/R) refers to Probability of match and Stability of match of primer 'mcb869' 
with different templates. A higher P,S score refers to the higher probabilities of significant 
amplification of specific template by the primer. It is calculated by Amplify (1.2) software. 
5 Table 2. Multiple sequence alignment of 472 bp fragment of mitochondrial cytochrome b 
gene (identified by inventors to fulfill the requirements of column 1, 2 and 3 mention 
under sub-heading 'Objectives of invention') of 221 animal species listed in Table 1. 
Alignments also show the binding sites for universal primers 'mcb398' and 'mcb869\ The 
symbol (*) refers to the nucleotide bases which are conserved amongst 22 1 animal species 

10 listed in Table 1). The alignments have been done using software CLUSTAL X (1.8). The 
nucleitide positions that are unmarked are variable amongst 221 animal species analyzed. 
These variable sites together constitute the molecular signature of an individual species, 
giving rise to molecular basis of species identification by our primers. 
Table 3. Results of the blast analysis of the sequence revealed from 'adil.flesh' in 'mito' 

15 database of NCBL It shows the most significant alignment of cytochrome b sequence (328 
bp) revealed from confiscated skin piece 'adil. flesh' with felis catus cytochrome b gene 
sequence (genbank registration number NC_00 1700.1, bits score 365, E value, e-101) 
registered in NCBI database (bits score 365 and E value e-101). It gives an indication that 
the species of analyzed material belongs to family felidae. It also fulfills the requirements 

20 of column 6 mention above under sub-heading 'Objectives of invention'. 

Table 4. Results of the blast analysis of the sequence revealed from 'adil. flesh' in Vzr' 
database of NCBI. It shows the most significant alignment of cytochrome b sequence (328 
bp) revealed from confiscated skin piece 'adil. flesh' with Panthera pardus cytochrome b 
gene sequence (genbank registration number AY005809, bits score 603, E value, e-170) 

25 registered in NCBI database. It gives an indication that the species of analyzed material 
belongs to Panthera paurdus origin. It also fulfills the requirements of column 6 mention 
above under sub-heading 'Objectives of invention'. 

Table 5. Reference animal belonging to family felidae selected for comparison with 
'adil. flesh' to confirm the findings of BLAST analysis results of which are mentioned in 
30 Table 3 and 4, respectively. The animals listed in SN. 1-21 represent different species of 
family felidae. SN. 22 and 23 are primate species taken for out-group comparisons. 
Table 6 Multiple sequence alignments of cytochrome b sequences (328 bp) revealed from 
'adil.flesh' and reference animals listed in Table 5. The positions that have a common 
nucleotide in all the animal species under investigation are shown with a star (*) mark; 
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however, the positions that are variable in any of the animals under investigation are 
unmarked. The nucleotides at these positions constitute the molecular signature of an 
individual species, which are unique and highly specific for its species. These signatures 
are the molecular basis of identification of individual animal species using our primers 
5 'mcb393' and 'mcb869\ 

Table 7 (Tables 7a, 7b, 7c and 7d). The comparison of the molecular signatures of 
different animal species investigated along with 'adil. flesh', the confiscated skin of 
unknown animal origin. This table demonstrates the variable positions (i.e. the positions 
which are not marked with star (*) symbol in Table 6), amongst the 328 bp fragment 

10 revealed from the animals listed in Table 5. The dot (.) mark represents the presence of the 
similar nucleotide as listed in lane 1 i.e. the sequence from "adil. flesh' at that position. It 
demonstrates that the signatures of each species are unique and specific to its species. The 
molecular signatures of 'adil.flesh' are comparable (except for position 37 which has a 
transition from *T to 'C') to the molecular signature of 'gzlL' i.e. the known leopard 

15 Panthera partus' source, indicating the identity of the source of confiscated skin 
'adil. flesh" as that of a leopard 'Panthera pardus ' source. The nucleotide variations (at the 
positions 153, 198, 223, 264, among the known leopards, (i.e. gzlL, gz2L, and gz3L, 
respectively)), give an idea about the geographical habitat of each animals. Various studies 
referring to molecular evolution of different animal species support this hypothesis 7:> ; 

20 however, it could further be confirmed by taking the reference animals from different 
geographical areas and analyzing by our primers 'mcb 398' and 'mcb869\ If we could 
generate the database of different haplotypes (i.e. habitat specific molecular signatures) of 
the animal species, it would also enable our primers to reveal the geographical location of 
the commitment of wildlife crime. 

25 Table 8. Percent similarity matrix calculated by pair-vise comparisons of nucleotide 
sequences aligned (illustrated in Table 6). The cytochrome b gene sequence of DNA 
isolated from confiscated material had maximum similarity (99.7% and 98.2%, with the 
lineages of animals *gz2L' and l gz3L\ respectively) with the sequences obtained from 
known normal leopard source, indicating its identity as that of a leopard origin. The 

30 similarity matrix has been calculated using the software PHYLIP (3.5). 

Table 9. Animals selected for validation of minimum P,S score for efficient amplification 
of cytochrome b gene of different origin by the primers 'mcb398* and 'mcb869\ P,S score 
of primers 'AFF' and k AFR* for these animals arc shown. 

Table 10. BLAST analysis of primers *mcb39S' in >rr database of NCBl . It demonstrates 



WO 02/077278 PCT/INO 1/00055 



that the 3* end of this primer is highly conserved among a vast range of animal species. It 
also shows the significant homology among the primer and templates (i.e. the cytochrome 
b gene fragment of different animal species), confirming the universal nature of our primer 
Table 11. BLAST analysis of primers 'mcb869' in nr database of NCBI. It demonstrates 
5 that the 3' end of this primer is highly conserved among a vast range of animal species. It 
also shows the significant homology among the primer and templates (i.e. the cytochrome 
b gene fragment of different animal species), confirming the universal nature of our 
primer. 

Table 12. Other animal belonging to distantly related animal species, investigated to 
10 confirm the universal nature of primers '111^398* and 'mcb869\ Gel photograph showing 
the PCR amplicons from these animals are shown in Figure 4. 

The mitochondrial cytochrome b gene has very widely been used in molecular taxonomic 
studies. It has immense capabilities to reveal different evolutionary lineages of animals in 
family, genus and species specific manner. It has also been used to classify the population 

15 of a particular species according to its demographic distributions 75 . The vast database of 
cytochrome b sequences of different animal species has accumulated in public databases 
such as Genbank and NCBI 1 " 65 . We have explored these unique characteristics of 
cytochrome b gene to establish the identity of confiscated remains of any unknown animal 
by inventing a pair of novel primers, 'mcb398' and 'mcb869\ that can amplify a small 

20 fragment (472 bp) of cytochrome b gene of wide range of animal species in universal 
manner. These primers work universally because its 3' ends target within a highly 
conserved region. 

The fragment of cytochrome b gene identified had all the features mentioned in columns I, 
2 and 3 listed under sub-heading 'Objective of invention'. We identified this fragment by 

25 aligning the cytochrome b gene sequences (1 140 bp) of 221 different animal species listed 
in Table I. These sequences are publicly available in NCBI DNA databases. These 
sequences were aligned using the software CLUSTAL X (1.8). As mentioned before, the 
' 472 bp fragment of cytochrome b gene identified by us to have the features mentioned in 
columns 1, 2 and 3 listed under sub-heading 'Objective of invention 1 includes the 

30 nucleotides between 398 to 869 in Antilope cervicapra and Felts catus; however, 399 to 
870 in Homo sapiens sapiens species. Except at few positions (marked as star (*) in Table 
2, the nucleotide sequences of this fragment are highly variable amongst the animal 
species, revealing the identity of the biological material belonging to that of an unknown 
animal origin by the procedure invented by us. As for identity of this fragment we arc 
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considering Antilope cervicapra as a representative species, and the sequence the above 

fragment of cytochrome b gene of Antilope cervicapra is mentioned herewith: 

Mitochondrial cytochrome b gene sequence (398-869 bp) of Antilope cervicapra 

"taccatgaggacaaatatctttttgaggagcaacagtcatcaccaatctcctttcagcaatcccatacatcggtacaaacctagtaga 

atgaatctgaggagggttctcagtagataaagcaacccttacccgatttttcgccttccactttatcctcccatttatcattgcagccctt 

accatagtacacctactgtttctccacgaaacaggatccaacaaccccacaggaatctcatcagacgcagacaaaat:,jcatLCcac 

ccctactacactatcaaagatatcctaggagctctactattaattttaaccctcatgcttctagtcctattctcaccggacctgcttggag 

acccagacaactatacaccagcaaacccacttaatacacccccacatatcaagcccgaatgatacttcctatttgcatacgcaatcct 

ccgatcaattcctaacaaactaggagg 

Table 2 presents the alignment of the above fragment of cytochrome b gene of 221 animal 
species. Each species in table 2 has been represented by a unique code, which is decoded in 
Table 1. We selected these species to represent the vast range of animal families of distant 
orders. Of 221 species, about 65 were the protected/endangered or rare species listed in 
Wildlife (Protection) Act , 1972 (Central Act NO 53 of 1972). These species are marked 
with symbol (*) in Table 1. The NCBI accession number refers to its registration number 
in NCBI database and the number in superscript represent the reference cited. Based on the 
aligned cytochrome b sequences of different 221 animal species the primers designed were 
as follow: 

Primers name Sequence (5 '-3 ') 

<mcb398' "TACCATGAGGACAAATATCATTCTG" 
'mcb869 ? "CCTCCTAGTTTGTTAGGGATTGATCG" 
Tables 2, 10 and 11, respectively, demonstrates that the 3' ends of the primers are highly 
conserved amongst all the animal species analyzed in-silico (In total 221 animal species 
listed in Table 1 and about 500 species listed in Tables 10 and 1 i, respectively) Also, the 
5' end of the primers were selected within the conserved region of cytochrome b gene to 
improve the probability and stability of match of the primers to their target sequences (i.e. 
the above mentioned 472 bp fragment of cytochrome b gene). The primers were 
thoroughly checked for internal stabilities, loop or dimmer formation using different 
software viz., 'Amplify (1.2)\ 'Primer3' ( http://w\vw.iienomeAvi.mit.edti/c^i^ 
bin/primer/primer3.cui ) as well as manually. . We assigned the P,S score (P=Probability of 
match, S=Stability of match) to the primers for each template using the software Amplify 
(1.2), The higher scores of P and S ensure a good amplification if all other conditions 
standard (which arc mentioned under 'Example 3*) are optimum. The Highest score for 
t mcb398' was 98,63 (i.e. the situation where the primer has perfect match with template); 
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however, the highest P, S for 4 mcb869' was recorded as 93, 68 for a complete match 
between the primer and template. The lowest P,S score observed for 4 mcb398' was 81,50 
for species Talpa europaea whereas 'mcb869' had a high P, S score for this species (92, 
57). The another species which have lowest P, S score for one of the two primers were 
5 Eumeces egregious and Equus ainus. Eumeces egregious had P, S score 86, 55 and 73,51 
for c rncb398' and 'mcb869\ respectively; however, the P, S score of Equus ainus was 
calculated as 91,61 and 73, 51 for 4 mcb398' and *mcb869\ respectively. All other animals 
had higher P, S scores then the above mentioned species. To ensure that these primers 
would work efficiently with the DNA template from the animals having the lowest P, S 

10 score for one of the primers, we designed an another experiment to validate the lower 
limits of one of the two primers sufficient for efficient amplification in PCR. We designed 
an another primer pair (AFF= 5'tagtagaatgaatctgaggagg3' and 
AFR^'atgcaaataggaagtatcattcS'.) having more mis-pairing at their annealing sites (but not 
at ends), therefore have less internal stability and lower P, S scores for its templates (listed 

15 in Table 9). The P,S scores of 'AFF' and 'AFR' were as calculated as low as 41 and 49 for 
Platanista gangetica and Sus scrofa These species were amplified efficiently using the 
primers 'AFF* and *AFR' (results shown in Figure 3) (keeping all other conditions 
standard i.e. the conditions mentioned in 'Example 3'). The lowest P,S scores (86, 55 and 
73,51 for species Eumeces egregious) for our primers 'mcb398' and *mcb869\ 

20 respectively, were higher then the above range of combined P, S scores of 'AFF' and 
'AFR' for species Sus scrofa (87, 52 and 87, 41), which was efficiently amplified by the 
primers 'AFF' and 'AFR'. It gives an indication that the primers 'mcb 398' and 'mcb 869' 
would work with all the species including Eumeces egregious efficiently to give rise to the 
expected product in PCR. This experiment confirmed that the primers 'mcb398' and 'mcb 

25 869' are capable of amplifying the cytochrome b fragment of most of the animal species in 
a universal manner. 

For further confirmation of universal nature of our primers, we blasted the sequence of our 
primers against the mito and nr databases of NCB[ using BLAST software. The results of 
these analyses are shown in Tables 10, and 1 1, respectively. 
30 Finally, the universal nature of the primers was tested in our laboratory with some more 
animal species listed in Table 12. These primers amplified all the animal species 
efficiently, giving rise to the band of expected size (472 bp). The results are shown in 
Figure 4. This experiments substantiated the results of P,S analysis and other ui-silico 
analyses to show that the primers fc mcb398'and k mch 869* arc universal primers. 
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The flow chart of establishing identity of the species of biological material of unknown 
animal origin using primers 'mcb398' and 4 mcbS69' 

Biological material of unknown animal origin 

i 

DNA isolation 



PCR amplification of DNA isolated using primers 'mcb398' and 4 mcb869' 

1 

Sequencing at both the strands in triplicate (using any standard procedure of sequencing such as using AB1 

Prism 3700, PE-Appiied Bio-systems) 

BLAST of revealed sequence in mify database if NCBI ( http ://www. ncbi.nlm. nih. go v/B LAST 
(it gives idea about the family of the analyzed material by producing the most significant alignment of the 
query sequence with the sequences registered in database) 

BLAST of revealed sequenced nr database of NCBI 
(http://www.ncbi.nIm.nih.gov/BLAST 
(it gives idea about the Genus or more precisely, species of the analyzed material by producing the most 
significant alignment of the query sequence with the sequences registered in database) 



Selection of reference animals belonging to the family/Genus/and species revealed by mito and nr BLAST 

searches 

Isolation of DNA from the blood of known reference animals; PCR amplification 
using primers '01^398* and fc mcb869* ; sequencing of the PCR products in triplicate using the same primers 

i 

Multiple sequence alignments of the revealed sequences of mitochondria) cytochrome b gene of known 
reference animals and the biological material of unknown animal origin using software such as 

Autoassembler (/CLUSTki X (1.8) 



1 



Identification of sequence from the aligned sequences 
that is homologous (or significantly similar) to the cytochrome b gene sequence of the DNA obtained from 

biological material of unknown animal origin. 



The species of homologous sequence would be the species of . the biological material under investigation 



WO 02/077278 PCT/INO 1/00055 

13 



Examples 

Example 1 

Example for identification of a fragment of cytochrome b gene fulfilling the requirements 
of columns 1, 2 and 3 mentioned under sub-heading 'Objectives of invention* of heading 
5 'Brief summary of invention' 

The cytochrome b molecule has very vastly been used in molecular taxonomic studies. 
Being a slow evolving gene, It has a tremendous information in its nucleotide sequences to 
distinguish the animals to their family, genus and species sources 1 " 65 . A vast database of 
the sequences of cytochrome b gene of different animal species has accumulated in the nr 

10 and mito databases of NCBL We have explored these qualities of cytochrome b gene to 
establish the identity of confiscated remains of unknown animal origin to its family, genus 
and species sources. For this purpose, we have identified a fragment of cytochrome b gene 
which is highly polymorphic inter-specifically, however, it is monomorphic among the 
individual of same species, therefore it can group the individual of an unknown species 

15 with the known individuals of reference species to which it belongs. In order to amplify 
this fragment from DNA isolated form any unknown origin, it was necessary that it remain 
flanked with the highly conserved sequences amongst a vast range of animal families. To 
identify such a unique fragment within the cytochrome b gene, we aligned the sequences of 
221 distantly related animal species (listed in Table 1) representing various families using 

20 software CLUSTAL X (1.8). These sequences were obtained from public database NCBI 
( http://www.ncbi. nlm.nih.gov ). The aligned data was examined carefully for the conserved 
sites amongst all the species included in in-silico analysis. We identified a fragment (472 
bp) of cytochrome b gene that was fulfilling all the requirements mentioned above and also 
under column 1, 2 and 3 of sub-heading 'Objectives of invention'. 

25 As for the identity of this fragment we would like to mention that it includes the 
nucleotides between 398 to 869 in Antilope cervicapra and Felis cams; however, 399 to 
870 in Homo sapiens sapiens species. Except at few positions marked as star (*) in Table 
' 2, the nucleotide sequences of this fragment are highly variable amongst the animal 
species, giving rise to their unique molecular signature. These molecular signatures are 

30 characteristic of its species and form the basis of revealing the identity of the biological 
material of an unknown animal origin by the procedure invented by us. Considering 
Antilope cervicapra as a representative species, the sequence of this fragment is mentioned 
herewith: 

Mitochondrial cytochrome b gene sequence (398-869 bp) at Antilope cervicapra 
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"taccatgaggacaaatatctttttgaggagcaacagtcatcaccaatctcctttcagcaatcccatacatcggtacaaacctagtaga 
atgaatctgaggagggttctcagtagataaagcaacccttacccgatttttcgccttccactttatcctcccatttatcattgcagccctt 
accatagtacacctactgtttctccacgaaacaggatccaacaaccccacaggaatctcatcagacgcagacaaaattccattccac 
ccctactacactatcaaagatatcctaggagctctactattaattttaaccctcatgcttctagtcctattctcaccggacctgcttggag 
5 acccagacaactatacaccagcaaacccacttaatacacccccacatatcaagcccgaatgatacttcctatttgcatacgcaatcct 
ccgatcaattcctaacaaactaggagg" 
Example 2: 

Example for development of universal primers to amplify the fragment identified 
mentioned under 'Example 1\ 

10 

A pair of universal primer was designed which has the following features: 

1. It targets the fragment identified (mentioned under 'Example T) to amplify it in 
polymerase chain reaction (PCR). 

2. Its 3' and 5' ends that are highly conserved (marked as star (*) in Table 2), 
15 amongst a vast range of animal species ensuring the amplification of the fragment 

mentioned above in a universal manner. The sequencing of the fragment amplified 
by these primes reveals the molecular signature of the species of analyzed material, 
which on comparison with the sequences of the known reference animals reveals 
the identity of the species of unknown biological material under investigation. 
20 3. The tm (melting temperature) of both primers was almost similar (about 58 degree 

centigrade) ensuring the significant annealing of both the primers to its template, 
therefore significant amplification of targeted region in PCR. 

4. The internal stability and P, S, score of the primers were ensured higher while 
designing it. The possibilities of internal loop formation, dimmer fon;:ation etc 

25 were also excluded by selecting its sequence uniquely. This ensured that the primer 

would be a good primer to be used in PCR for amplification of DNA from 
unknown animal origin. 

5. The 3* end of the primers were ensured to have either 'G' or l C' to increase the 
probability of strong bonding at its 3 'ends, which is necessary for efficient 

30 amplification of DNA template in PCR. It also strengthens the universal nature of 

the primer. 

6. The sequences of the primers were ensured to be unique so that it does not give rise 
to non-specific and spurious products in PCR leading to confusion. It improved the 
efficiency and quality of the technique invented by us. 
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7. These primers were named as c mcb398' and 'mcb869' because of its property to 
amplify a region of mitochondrial cytochrome b gene between nucleotides 398 to 
869 of Antilope cervicapra, a representative animal species for this invention. We 
took this animal species as representative species because the idea of developing 

5 such a novel primers came in the mind of inventors while they were working on the 

genome of this animal in Centre for Cellular and Molecular Biology, Hyderabad, 
India. 

8. The sequences of the universal primers invented are as follows: 
Primers name Sequence (5 '-3') 

1 0 c mcb398' "TACCATGAGGACAAATATCATTCTG" 

'mcb869' "CCTCCTAGTTTGTTAGGGATTGATCG" 
Example 3: 

Example for development of universal PCR conditions to ensure the amplification of a 
template of any unknown origin in PCR, hence strengthening the universal nature of the 
1 5 technique invented by us 

The PCR conditions developed had the following unique features: 

1 These were capable of amplifying the DNA template of any animal origin in an 
universal manner using the universal primers mentioned under 'Example 2\ 
20 2. The conditions were selected to ensure the comparable annealing temperature for 

both the primers i.e. t mcb398' and 4 mcb869\ 

3. The PCR conditions standardized herewith are universal; therefore, the 
possibility of PCR failure with a template of unknown origin due to non-standard 
conditions is excluded. It ensures the universal nature of our technique to be used in 

25 wildlife forensics. 

4. The universal conditions mentioned above are: 

I • 

Amplification reactions should be carried out in 20 j^l reaction volume containing 
approximately 20 ng of template DNA, 100|um each of dNTPs, 1.25 pmole of each primer, 
30 1.5mM MgCl 2 , 0.5 unit of AmpliFa</ Gold (Perkin-Elmer-Cetus, USA) DNA polymerase 
and IX PCR buffer (lOmM Tris-HCt, pH 8.3, and 50mM KCI). The amplification profiles 
followed should be: an initial denaturation at 95°C for 10 min, followed by 35 cycles each 
of denaturation at 95°C for 45 s, annealing at 5l°C for 1 min, and extension at 72°C for 2 
min. The extension step at 35 th cycles should be held for 10 min. 
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Example 4: 

Establishing the universal nature of our primer and experimental evidences to demonstrate 
the universal nature of primers: 

The universal nature of the primers *mcb398* and 'mcb 869' was ensured by the following 
5 measures: 

(a) Selecting the primers from the aligned cytochrome b gene sequences of 221 animal of 
distantly related species: 

The cytochrome b gene sequences (1140 bp) were aligned using software CLUSTAL X 
(1.8). The region of cytochrome b gene that was most conserved amongst 221 animal 
10 species was selected to design the primers. 

(b) Selecting the 3' and 5' ends of the primers at the highly conserved positions of 
cytochrome b gene: 

The 3' and 5' ends of the primers were ensured to anneal to a highly conserved position 
amongst 221 animal species representing a vast range of animal families. It was done to 
15 ensure an efficient amplification of all the species in PCR. These positions are shown with 
star (*) mark in Table 2. 

(c ) Ensuring either *G' or C C* at the 3' end of the primers: 

It was ensured the primers to have either 4 G' or 6 C at its 3' ends as these are the 
nucleotides that ensure the strong bonding at the 3' ends of the primers due to three 
20 hydrogen bonds while pairing with each other. The strong bonding at 3' ends helps the 
primers to anneal properly with its template resulting in significant amplification in PCR. 

(d) Selecting the sequences of the primers to ensure a higher internal stability, higher P, S 
score, and no primer dimmer and loop formation: 

The sequences of the primers were selected to have a high P, S score for a vast range of 
25 animal species (Shown in Table 1). The care was taken to exclude the possibilities of loop 
or primer dimmer formation that could reduce the efficiency of the primers in PCR. 

(e) Selecting the sequence of the primers with a comparable melting temperature: 

j t 
The sequences of the primers were selected to have a comparable melting temperature so 

that these could work together to amplify a DNA template in PCR at a similar annealing 

30 temperature. The melting temperature of both the primers was about 58 degree centigrade 

and the annealing temperature used in PCR is 51 degree centigrade. 

Experimental evidences to demonstrate the universal nature of primers: 
( I ) Evidence from fn-silico analysis : 
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(a) Selecting the primers within the most conserved region of mitochondrial cytochrome b 
gene 

As mentioned above, the primers were designed to anneal within a highly conserved region 
of mitochondrial cytochrome b gene fragment of 472 bp. Table 2 presents the alignment of 
5 the above fragment of cytochrome b gene of 221 animal species representing a vast range 
of animal families. The conserved positions of nucleotide sequences are shown with star 
(*) mark in Table 2 

Table 2 also demonstrates that the 3' ends of the primers are highly conserved amongst all 
the animal species analyzed in-silico. In the aligned sequences, the conserved nucleotides 

10 are marked with symbol (*). Also, the 5' end of the primers were selected within the 
conserved region of cytochrome b gene to improve the probability and stability of match of 
the primers to their target sequences (i.e. the above mentioned 472 bp fragment of 
cytochrome b gene). The primers were thoroughly checked for internal stabilities, loop or 
dimmer formation using different software viz., * Amplify (1.2) \ 'Primer 3' 

15 (http://www.genome.wi < mit.edu/cgi-bin/primer/primer3.cgi ) as well as manually. 

(b) P, S, score analysis: 

We assigned the P,S score (Probability of match, S-Stability of match) to the primers 
for each template using the software Amplify (L2). The higher scores of P and S ensure a 
good amplification if all other conditions standard (which are mentioned under 'Example 

20 3') are optimum. The Highest score for 'mcb398' was 98,63 (i.e. the situation where the 
primer has perfect match with template); however, the highest P, S for t mcb869' was 
recorded as 98, 68 for a complete match between the primer and template. The lowest P,S 
score observed for 6 mcb398' was 81,50 for species Talpa europaea whereas 'mcb869' had 
a high P, S score for this species (92, 57). The another species which have lowest P, S 

25 score for one of the two primers were Eumeces egregious and Equus ainus, Eumeces 
egregious had P, S score 86, 55 and 73,51 for 'mcb398' and 4 mcb869', respectively; 
however, the P, S score of Equus ainus was calculated as 91,61 and 73, 51 for l mcb398' 
and 'mcb869\ respectively. All other animals had higher P, S scores then the above 
mentioned species. To ensure that these primers would work efficiently with the DNA 

30 template from the animals having the lowest P, S score for one of the primers, we designed 
an another experiment to validate the lower limits.of one of the two primers sufficient for 
efficient amplification in PCR. We designed an another primer pair 
(AFF= 5 ctagtagaatgaatctgaggagg r and AFR= 5 tatgcaaataggaagtatcattc 3 .) that have more 
mis-pairing at their annealing sites (but not at ends), therefore have less internal stability 
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and lower P, S scores for its templates (listed in Table 9). The P,S scores of *AFF' and 
'AFR' were as calculated as low as 41 and 49 for Platanista gangetica and Sus scrofa 
These species were amplified efficiently using the primers 'AFF' and 4 AFR* (results 
shown in Figure 3) (keeping all other conditions standard i.e. the conditions mentioned in 
5 'Example 3'). The lowest P,S scores (86, 55 and 73,5 1 for species Eumeces egregious) for 
our primers < mcb398' and 'mcb869\ respectively, were higher then the above range of 
combined P, S scores of <AFF 7 and *AFR* for species Sus scrofa (87, 52 and 87, 41), 
which was efficiently amplified by the primers 'AFF' and C AFR\ It gives an indication 
that the primers 'mcb 398' and 'mcb 869' would work with all the species including 
10 Eumeces egregious efficiently to give rise to the expected product in PCR. This 
experiment confirmed that the primers 'mcb398' and 'mcb 869* are capable of amplifying 
the cytochrome b fragment of most of the animal species in a universal manner. 
© BLAST analysis: 

15 The sequences of primers *mcb398* and c mcb869' were blasted against mito and nr 
databases of NCBI to see its significant alignments with the sequences registered in 
GenBank. As expected, the most significant alignments of the sequences were found with 
the cytochrome b gene regions (within the 472 bp fragment mentioned in 'Example V) of 
different animal species. This analysis also showed that the 3' as well as 5' ends of the 

20 primers were highly conserved amongst a vast range of animal species, confirming the 
universal nature of the primers (Tables 10 and 1 1, respectively) 
(2) Evidence from bench work/experiments done in laboratory conditions: 

The DNA from different animals belonging to distantly related species (mentioned in 
25 Table 12) was isolated and subjected to PCR amplification using the primers invented by 
us i.e. the primers i, mcb39B" and 'mcb869' The PCR products amplified were resolved in 
agarose gel by electrophoresis and visualized under UV light. The PCR products of 
expected size (472bp) were obtained from all the animals confirming the universal nature 
of our primers. These results are shown in Figure 4. 

30 

Example 5: 

Example to establish the identity of confiscated remains from unknown animal origifl 
using the universal primers 'mcb398* and *mcb869\ 

The step-vise procedure to establish the identity of the biological material from a« 
35 unknown animal source is mentioned below: 
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Biological material of unknown animal origin 
DNA isolation 



PCR amplification of DNA isolated using primers 'mcb398' and 'mcb869' 



I 



Sequencing at both the strands in triplicate (using any standard procedure of sequencing 
such as using ABI Prism 3700, PE-Applied Bio-systems) 

BLAST of revealed sequence in mito database of NCBI 
(http://www.ncbi.nlm.nih.gov/BLAST 
(it gives idea about the family of the analyzed material by producing the most significant 
alignment of the query sequence with the sequences registered in database) 



i 



BLAST of revealed sequence in nr database of NCBI 
(http://www.ncbi.nlm.nih,gov/BLAST 
(it gives idea about the Genus or more precisely, species of the analyzed material by 
producing the most significant alignment of the query sequence with the sequences 

registered in database) 



Selection of reference animals belonging to the family/Genus/and species revealed by mito 

and nr BLAST searches 



Isolation of DNA from the blood of known reference animals; 
PCR amplification using primers i mcb398' and 'mcbSeS^ ; sequencing of the PCR 
products in triplicate using the same primers 



Multiple sequence alignments of the revealed sequences of mitochondrial cytochrome b 
gene of known reference animals and the biological material of unknown animal origin 
using software such as Autoassembler/CLUSTAL X (I S) 
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Identification of sequence from the aligned sequences 
that is homologous (or significantly similar) to the cytochrome b gene sequence of the 
DNA obtained from biological material of 
unknown animal origin. 



The species of homologous sequence would be the species of the biological material under 



Application of the above information for the objectives mentioned in columns 7-13 under 
sub-heading 'Objective of invention' of heading 'Summary of invention* 
15 Example 6: 

The actual execution of the technique invented 

As a first application and to demonstrate the ease and utility of this method, we 
investigated a case of forensic identification submitted at our laboratory to seek scientific 
opinion on animal hunting evidence. In this case, we received the half burned remains of 

20 an unknown animal, confiscated by the crime investigation agencies. The DNA was 
isolated from the above material following standard methods 74 and subjected to PCR 
amplification using the primers mentioned above (viz., 'mcb398' and 'mcb869'). 
Amplification reactions were carried out in 20 fj.1 reaction volume containing 20 Tig of 
template DNA, lOOjim each of dNTPs, 1.25 pmole of each primer, l.5mM MgCl 2 , 0.5 unit 

25 of AmpliTaq Gold (Perkin-Elmer-Cetus, USA) DNA polymerase and IX PCR buffer 
(lOmM Tris-HCl pH 8.3, and 50mM KCl). The amplification profiles followed were; an 
initial denaturation at 95°C for 10 min, followed by 35 cycles each of denaturation at 95°C 
for 45 s, annealing at 51°C for 1 min, and extension at 72°C for 2 min. The extension step 
at 35 th cycles was held for 10 min. 

30 The PCR products obtained were sequenced in automated work station (ABI Prism 3700, 
PE-Biosystems) on both strands in triplicate and the sequence resolved (328 bp, shown in 
Figure la) was blasted against mito databases of NCBl using BLAST program 73 . The most 
significant alignment (bits Value 365, E value e" l0t ) of this sequence was produced with 
the cytochrome b gene sequence of Felis catus. (Table 3) indicating that species of 



10 



investigation 



WO 02/077278 



21 



PCT/IN01/00055 



analyzed material belongs to family felidae. Further, the above sequence revealed from the 
confiscated remain was blasted against nr databases of NCBI using BLAST program. The 
most significant alignment (bits Value 603, E value e" 170 ) of this sequence was produced 
with the cytochrome b gene sequence of Panthera pardus (Table 4), indicating the identity 
5 of the analyzed material as that of a Panthera pardus source. Based on this information, 
we selected the reference animals listed in Table 5 representing different species and 
subspecies of felidae. The DNA isolated from reference animals was amplified and 
sequenced on both strands in triplicate using the primer pair mentioned above. Consensus 
sequences obtained were aligned using program CLUSTAL X (1.8) (Table 6). Sequence 

10 comparisons identified 113 variable sites in total amongst all animals analyzed TTable 7). 
Pair-vise comparisons of sequences were performed to find out the variation among 
different animals investigated. All the species investigated were differentiated by a their 
unique nucleotides sequences. The molecular signatures of different reference animals 
were compared with the molecular signature of the confiscated skin 'adil.flesh'. Table 7 

15 demonstrate that the maximum similarity of the adil.flesh with 'gzlT i.e. known Leopard 
{Panthera pardus) species, indicating the identity of the adil.flesh, the confiscated skin, as 
that of a Panthera pardus origin. We also calculated the similarity matrix showing the pair- 
vise similarity amongst the animal species under investigation using PHYLIP software 
This matrix is shown in Table 8. It demonstrates that the animals belonging to different 

20 species had more variation; however, the animals of same species had maximum similarity 
among their cytochrome b sequences. The cytochrome b gene sequence of DNA isolated 
from confiscated material had maximum similarity with the sequences obtained from 
known Leopard source(99.7%, and 98.2 with 'gzlF and 'gz21\ respectively); establishing 
the identity of the source of confiscated material as that of a Normal leopard (Penthera 

25 pardus) species. The step-vise procedure involved in above analysis is illustrated in Figure 
la, I band 1c, respectively. 

Thus, the primers invented by us can generate the molecular signature from any biological 
material of unknown animal origin, which actually is the characteristic of its family, genus 
and more precisely, the species. When these signatures are compared in-silico with the 
30 signatures already available in public databases (viz., GenBank, NCBI database etc) using 
BLAST software 73 , it indicates identity of the family, genus or species of the analyzed 
material, which in turn is confirmed practically by comparing with the reference animals of 
the revealed family, genus or species, by including them in the further analysis by the 
primers *mcb398' and 4 mcb869\ Application Of the information revealed could be in 
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fulfilling the requirements of objectives mentioned in columns 7-13 under sub-heading 
'Objective of invention' of heading 'Summary of invention' 

The method of the invention can be used to establish the identity of confiscated animal 
parts and products is one of the key requirements of wildlife identification in forensics. It 
5 is needed to establish the crime with the criminal beyond a reasonable doubt to avoid the 
human violation of wildlife resources. Various morphological biochemical and molecular 
approaches have been given for this purpose; however, none of the current methods is 
universally applicable to detect the mutilated animal remains of unknown origin. We have 
identified a fragment on the mitochondrial cytochrome b gene, which has enormous 

10 information to differentiate among various animal species back to the family, genus and 
species sources. We have also found that this fragment is flanked by the highly conserved 
sequences amongst a vast range of animal species. We invented a pair of universal primer 
that can amplify this fragment of DNA isolated from the biological material of an 
unknown animal origin in polymerase chain reaction (PCR) to reveal its identity at species 

15 and sub-species sources. This novel invention has great potential to revolutionize the 
whole scenario of wildlife forensic identification and crime investigation. 
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Table 1. The animal species included in the study for in-silico analysis 



SN. Code 



Name 



NCBI accession ff *P.S/F *P,S/R 



I 


aep.mcl 


Aepyceros melampus 


2 


ore. ore 


Oreotragus oreotragus 


3 


add.nas 


Addax nasomaculatus 


4 


ory.dam 


Oryx damah 


5 


hip.equ 


Hippotragus equinus 


6 


alc.bus 


Alcelaphus buselapkus 


7 


si g. lie 


Sigmoceros lichtensteinii 


3 


bexhun 


Beatragus humeri 


9 


dam.lun 


Damaliscus lunatus 


10 


con.tau 


Connochaetes taurinus 


1 1 


bis.bon 


Bison bonasus 


12 


bos.gni 


Bos grunniens * 


13 


bos.tra 


Bos tragocamelus * 


i J. 

A.** 


OUU4.UUU 


Ruhalus bub alls * 


15 


bub. mi n 


Dtrh/f 7t/c rrttTt/I/JK&TLSlS 

£f {-i [JfJ I lij (/ii/iUwi w# *^ 


10 


trxang 


I rugSLuynu^ u/*ij««»«*** 


17 


trxeur 


i rageiapnus <£ury*~<zi 


1 O 

lo 


nem.cau 


KlamnrUnoAu*: CflltdfltUS * 


19 


pse.nay 


rscuaois nayaur 


20 


aimn.Icr 


Ammotragus lervia 


21 


cap.fal 


Copra falcorteri m 


22 


cap.ibe 


Copra ibex* 


23 


hem.jem 


Hemitragus jemlahicus * 


24 


rup.pyr 


Rupicapra pyrenaica 


25 


nip.rup 


Rupicapra rupicapra 


26 


pan.hod 


Pantholops hodgsoni 


27 


bud.aix.cax 


Budorcas taxicolor taxicolor* 


23 


ovi.amm 


Ovis ammon * 


29 


ovi.vig 


Ovls" vignei* 


30 


cap.cri 


Capcorrtis crispus * 


31 


ovi.mos 


Ovibos moschatus 


32 


ore.ame 


Oreamnos americanus 


33 


ccp.dor 


Cephalophus dorsal is 


34 


ccp.max 


Ccphalophus maxwellii 


35 


ale. ale 


Alecs alecs 


36 


hyd.ine 


Hydropotes inermis 


37 


mun.mun 


Muntiacus munt/ak* 


38 


cer.cle.kan 


Cervus elaphus kansuensis * 


39 


cer.ele.xan 


'Ccrvwj elaphus xanthopygus" 


40 


cer.elc.can 


Cervus elaphus canadensis m 


41 


cer.nip.ee 


Census n ippo n ccn t ra I is 


42 


cer.nip.ye 


Cervus nippon yesoensts 


43 


ccr.ntp.kc 


Ccr\-us nippon kcramae 



AF036289 1 


97, oU 


94 6^ 

7**, 


AF0362SS' 


OO <*7 


Q4 €0 


AF034722 : 


97, 60 


yj, oo 


AJ2226S5 1 


90, 58 


Vj, oo 


AF022060 J 


98, 63 


o < <^ 


AJ222681 1 


97, 60 


OO ^ Q 


AF034967 4 


97, 60 


no ^ o 

98, 63 


AF034968 4 


97. 60 


94, 62 


.AF0I6635 3 


97, 60 


77, 55 


AF016633 3 


82, 56 


93, 62 


Y15005 5 


90, 53 


87, 63 


AF091631* 


90, 58 


94, 62 


AJ222679' 


90, 53 


95, 66 


D34637 7 


97, 60 


93,64 


D82395* 


97, 60 


87, 62 


AF09I633 6 


97, 60 


87, 63 


AF036276' 


90, 58 


97,64 


UI7361' 


95,61 


93, 59 


AF034732 2 


89, 55 


89, 59 


AF034731 2 


94, 58 


97, 63 


D84202 10 


98, 63 


95, 66 


AF034735 2 


98, 63 


89, 53 


AF034733 2 


95,61 


90,61 


AF034726 2 


95,61 


89, 59 


AF034725 2 


95,61 


94, 64 


AF034724- 


98, 63 


95, 66 


U17S63* 


90, 53 


95. 66 


AF0147* ? 7 Z 


98, 63 


97, 64 




93,63 


97. 64 


AJ JUH JVJ£ 


98, 63 


94.63 


f ft 

Ul ' oo^ 


98,63 


92.61 




93.63 


94, 62 


AF09I634* 


97.58 


90. 61 


AF096629 13 


97. 60 


33. 53 


AJ000026' 4 


95.61 


93. 59 


AJ000023' 4 


97.60 


90. 63 


AF042713' 1 


90. 53 


93. 64 


AB02109S'* 


93.63 


32. 59 


AB021097 16 


93. 63 


82. 59 


AB021096" 


93.63 


90. 61 


AB021094 1 * 


93. 63 


90. 61 


AB021095** 


93.63 


90. 61 


AB02I09I" 


93.63 


90. 61 
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44 


cer.nip.pu 


Cervus nippon pulchcllus 


45 


cer.nip.ni 


Cervus nippon nippon 


46 


ccr.e!a.sc 


Cervus elaphus scoticus 


47 


ccr.dam 


Census dama 


43 


ran. tar 


Rangifer tarandus 


49 


mos.fus 


Moschus fuscus * 


50 


mos.lcu 


Moschus leucogaster* 


51 


rnos.chr 


Moschus chrysogester* 


52 


mos.bcr 


Moschus bercovskii * 


53 


mos.mos 


Moschus moschiferus* 


54 


kob.ell 


Kobus ellipsiprymnus 


55 


kob.meg 


Kobus megaceros 


56 


red. am 


Redunca arundinum 


57 


red. ful 


Redunca fulvorufula 


58 


neo.mos 


Neotragus moschatus 


59 


pel.cap 


Pelea capreolus 


60 


ant-cer 


Antilope cervicapra * 


61 


sax. tat 


Sai&a tatarica 


62 


gaz-dam 


Qazella dama 


63 


our.our 


Ourehia ourebi 


64 


gaz^gaz 




65 rap. me! 




66 


mad.kir 


irl LiCiUlf UU /W/ ■*»* 


67 


anLame 




68 


trajav 


Tragulus joy cm icits 0 


69 


tra^iap 


i raguius napu 


70 


bal.acu 


iSalaenoptera acuiorosiraiu 


71 


baLbon 


Bs*T***m—*\f*t j#p*rt hinnn /*r p>ti 


72 


baLbor 


Balaenoptera borealis * 


73 


bal.edi 


Balaenoptera edeni 


74 


esc.rob 


Eschrichtius robustus * 


75 


bal.mus 


Balaenoptera mus cuius * 


76 


meg.nov 


Megaptera novaeangliae m 


77 


bal.phy 


Balaenoptera physalus m 


78 


cap. mar 


Caperea marginata 


79 


cep.com 


Cephalorkynchus commersonii 


80 


cep.eut 


Cephalorhynchus eutropia * 


81 


lag.obl 


Lagenorhynchus obliquidens 


82 


cep.hea 


Cephalorhynchus hcavisidii 


83 


cep.hec 


cephalorhynchus hectori * 


84 


(ag.aus 


Lagenorhynchus austral is 


85 


lag.cru 


Lagenorhynchus cruciger 


86 


lag.obs 


Lagenorhynchus obscurus 


87 


lis. bar 


Lissodclphis boreal is 


83 


lis. per 


LiSSodclphis peronii 


89 


glo.mac 


Clobiccphala macrorhynchus 


90 


gfo.mel 


Clobiccphala mclas 


91 


fcr.att 


Fcrcsa attcnuata* 



AB02JQ90 1 * 


98. 63 


90,61 


AB02I093'* 


93, 63 


90,61 


A802I099'* 


98. 63 


90.61 


AJ000022" 


93, 63 


88, 53 


AJ000029' 4 


93. 63 


89. 57 


AF0268S3' 7 


90, 59 


90,61 


AF0268S9' 7 


90, 59 


90,61 


AF026837' 7 


90, 59 


90,61 


AF026886' 7 


90, 59 


90.61 


AF026833' 7 


90, 59 


92,61 


AF022059 1 


91.61 


95. 66 


AJ222686' 


91,61 


83,56 


AF09662S' 3 


91.61 


94, 62 


AF036234* 


89,57 


94, 62 


AJ222683' 


89. 57 


94, 62 


AF022055 1 


91.61 


90.61 


AF02205S J 


82, 56 


93,64 


AF064487'* 


91,61 


92,61 


AF025954 1 


91,61 


92,61 


AF036288 1 


82, 56 


82, 59 


AJ222682 1 


91,61 


89, 57 


AF022053 5 


81,54 


80, 50 


AF022070 3 


90, 58 


97, 65 


AF091629* 


98, 63 


93, 68 


D32139" 


86, 57 


86, 59 


X56288 :o 


81,52 


93, 53 


X75753 Zi 


89, 56 


97,61 


X75581 21 


89, 56 


93, 59 


X7553 - * 11 


89, 56 


93.59 


X75533 :i 


89, 56 


83, 54 


Y75535 11 


97.61 


86, 57 




97. 57 


93,59 


Y755JU' 1 

A. / J JO** 


97. 61 


94, 63 


Mr* nnn'Ji^ 


97, 57 


94, 63 


A. / jjoO 


93. 55 


91, 53 


ArU6**UO 


85. 51 


33, 55 


Ai 7 0o4U / 2 


85. 51 


92, 59 


AF0840O7 


Q4 59 


92, 59 


AF08407Q 


O/, 


97, 63 


AFQ84Q71 






AF0S4069 


OO. J** 


92. 59 


AF084Q68 




92. 59 


AFOS--066 : * 


36. 54 


92, 59 


AF0S4Q64 :4 


85. 51 


92. 59 


AFO5^065 : * 


86. 54 


92. 59 


AF0S40S5 :4 


94, 59 


S3. 55 


AF084Q56 : * 


94. 59 


S3. 55 


AF084052 :4 


94. 59 


92. 59 
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92 


pcp.clc 


Pcponoccphala clcctra* 


93 


cra,cri 


Grampus griscus 


94 


psc.cn 


Pscudorca crassidens* 


95 




Lagenorhynchus acutus 


96 


orct.brc 


Orcinus orca 


97 

* * 


nrca bre 


Orcaella brcvirostris 




A^X ran 


Delphinus capcnsis 


99 


dcl.tro 


Delphinus tropical is 


i no 

1 v/VI 


del del 


Delphinus delphis 


101 


stcn.cly 


Stenella clymene 


i n*> 


stcn.coc 


Stenella coeruleoalba 




fttf trftt 

vur.tiuu 


Tursiops aduncus 


1 fid 


Ct*»« fro 


Stenella frontalis 




saus.chi 


Sousa chinensis 


1 UO 


didl.lUIl 


Stenella longirostris 


1 U / 


turs.tni 


Tursiops truncatus 


1 Aft 


1 *i rt ^ 4 In 

ia.gc.aio 


Lagenorhynchus alborostris 


luy 


sten.bre 


K rp-nn hrpd/wenSlS 


1 10 


sota,flu 


^ntnlia ftziviatilis 


111 


aCLicu 


n*Jnhinaaferus leucas 


112 


mono.mon 


tvlOnOOOti /ric/»*w*»&* 


113 


plat, gin 


riatanista gange.ica. 


114 


plaLmin 


rlatanisia minor 


115 


kogi-bre 


Kogia breviceps 


116 


kogi^im 


Kogia simus 


117 


phys.cat 


Physeter catodon 


113 


lipo.vex 


Lipotes vexillifer* 


119 


p hoc .sin 


phocoena sinus 


120 


bera.bai 


Berardius bairdii 


121 


ziph.car 


Ziphius cavirostris 


122 


meso.eur 


Mesaplodon europaeus 


123 


mcso.bid 


Mesoplodon bidens 


124 


mescden 


Mesoplodon densirostris 


125 


hypc.amp 


Hyperoodon ampullatus m 


126 


meso.per 


Mesoplodon peruvianus 


127 


poncbla 


Ponto porta blainvillei 


123 


hipp.amp 


Hippopotamus amphibius 


129 


hex.Ub 


Hexaprotodon liberiensis 


130 


rhtn.son 


Rhinoceros sandaicus* 


131 


ccra 


Ceratotherium simum 


132 


die. sum 


Diccrorhinus sumatrensis 


133 


cquu 


Equus asinus 


134 


baby.bab 


Babyrousa babyrussa 


135 


phac.afr 


Pltacochocrus africanus 


136 


sus.scr.cw 


Sus scrofa haplotype £M'BS m 


137 


sus.bar 


Sus barbatus 


138 


lama.gla 


Lama glama 


139 




lanta guanican 



*4 

AF0S4053* 


9"*, J" 


OO, J-> 


AF0S4059-' 


97, 01 


ftQ *\0 


AFQS4Q57- 4 


94, 59 




AF0S4075 14 


98, 63 


ay, J7 


AF08406i :4 


86, 57 




AF084Q63 :4 


36, 57 


91 , 54 


AF084037 :4 


96, 54 


97, 63 


AF0840SS :4 


97, 57 


97, 63 


AF0840S5 : * 


97. 57 


97, 63 


AF034083 :4 


97, 57 


97,63 


AF084082 :4 


97. 57 


97, 66 


AF084092 : * 


97. 57 


97, 63 


AF084090 :4 


97, 57 


97, 63 


AF0840S0- 4 


97, 57 


83,59 


AF084103 :4 


97,61 


97, 63 


AF084095 :4 


97. 57 


96, 59 


AF084074' 4 


97,61 


97, 66 


AF08407 1 7- 4 


97,61 


94, 64 


AP304067 :5 


97,61 


97, 63 


U72037 26 


97,61 


95, 66 


U72038 : * 


97,61 


95. 66 


AF304070 25 


97,61 


86, 59 


X92543 17 


97,61 


86, 59 


U72040 25 


97, 59 


90, 63 


AF304072 :s 


96, 55 


92, 63 


AF304073 :i 


97. 57 


80, 53 


AJ304071 23 


89, 56 


83, 53 


AF08405i :4 


87, 49 


92, 62 


X9254P 7 


96, 55 


90, 59 


X92540 :7 


97.61 


. 89, 57 


X92537 :7 


97, 57 


90,61 


X9253S 17 


97,61 


92,61 


X92536 :7 


91,61 


94,63 


X92539" 7 


97,61 


90.65 




97,61 


36, 53 




92, 59 


83. 55 


vn<is i 

I voo \ j 


92, 53 


95. 66 


vnoc i J-* 
YUoo i •* 


93. 63 


97. 66 




90. 59 


37. 61 




90. 59 


90,63 


a r-74S7 -, 3 , ° 


90. 59 


36. 57 


NC_00i7SS Jt 


91.61 


73. 51 


Z50106" 


39, 56 


1 35.56 


Z50090" 


90. 59 


37. 54 


AF1365J9' 4 


97. 57 


33. 54 


Z50107 n 


97. 57 


35. 55 


U06429 ,f 


39. 55 


35. 53 


Y0S8i: ; "' 


33, 54 


36. 57 
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140 vie. vie 

141 cam.bac 

142 arc. for 

143 arc.gaz 

144 cum.jub 

145 zai.cal 

146 odo.ros 

147 pho.vit 
143 pho. fascia 

149 pho.gro 

150 cys.cri 

151 hyd.lcp 

152 lep.wcd 

153 mir.leo 

154 eri.bar 

155 mon.sch 

156 hela.mal 

157 sel.thi 
.158 ail-ful 

159 fel 

160 can 

161 tal 

162 glxsab 

163 gta-vol 

164 hyl.pha 

165 pct.se: 

166 bcl.pca 

167 pte.mom 
163 gala.demi 

169 pero.pot 

170 gala.mat 

171 gala.rnoh 

172 oto.gar 

173 lor.tar 

174 nyc.cou 

175 mus 

176 gorr 

177 homo 
173 dug. dug 
179 cic.ma* 

130 afr.con 

131 pavo.mut 

1 32 tra.bly 

133 tra.sat 

134 tra.cob 

135 tra.tcm 

136 arg.arg 

137 cat.wal 



Vicugna vicugna 
Camdus bactrianus 
Arctocephalus forstcri 
Arctocephalus gazclla 
Eumctopics jubatus 
Zalophus californianus 
Odobenus rosmarus 
Phoca vitulina 
Phoca fescicta 
Phoca groenlandica 
Cystophora cristaia 
Hydrurga leptanyx 
Leptonychotes weddelli 
Mirounga leonina 
Erignathus barbatus 
Monachus schauinslandi 
Helarctos malayanus m 
Selenarctos thibetanus* 
Ailurus fulgen's 
Felts catits 
Canis familiaris 
Talpa europaea 
Claucomys sabrinus 
Glaucomys volans 
Hylopetes phayrei * 
Petinomys setosus m 
Belomys pearsonii m 
Pteromys momonga m 
Galagoides demidojf 
Perodicticus potto 
Calago matschiei 
Galago moholi 
Otolemur gamettii 
Loris tardigradus* 
Nycticebus coucang* 
Mus musculus 
Gorilla gorilla 
Homo sapiens sapiens 
Dugong dugong* 
Elephas maxim us * 
A/ropavo congensis 
Pavo muttcus m 
Tr ago pan 6/w/m * 
Tr ago pan satyr a * 
Tragnpan cabo(t 
Tragopan tew mi nek it m 
Argusiamts urgus 
Cutrcus wadtcht * 



UUvH JO 


89.55 


85,53 




94. 53 


86. 53 


Ab2Zyl 


97, 60 


87, 64 


Vt? *7*7Q"7^* 

XSLlyL 


94.53 


87.64 


X3^3 1 1 


97. 57 


36, 57 


XS231Q 


39, 55 


86. 57 


XS2299 


91. 61 


31. 52 


X32306 


90, 53 


87, 64 


XS2j02 


93 63 


95, 66 


XS2303 3 * 


y » t * 


90, 61 


XS2294 3 * 


07, Jv 


87, 64 


XS22 97 36 




S2 54 


X72005 37 


Off fil 
yo, Oj 


y i , 


XS2298 36 


BO 


82 59 


XS2295 36 


QO 

o?, jo 


87 63 


X72209 37 


O t At 

y I, 01 


87 60 


U1SS99 :I * 




9Q 63 


AB020910 3 


89, j * 


R7 64 


X94919 40 


9J, j3 


87 64 


NCJ301700 4 ' 




00 63 


NCJ)0200S 4Z 


93, 53 




Nc]]00239i 43 


81, 50 


J / 


AF01I738' M - 


90, 59 


32, J** 


AB030261 45 


9Q, 59 


8 /, 00 


AB030259 45 


91.61 


81, 50 


AB03 0 2 60 45 


91,61 


81, 30 


AB030262 45 


91,61 


87, 64 


AB030263 45 


97,61 


90, 6j 


AF2714I1 46 


97, 53 


87, 64 


AF271413* 


97, 60 


87, 63 


AF2714Q9 4 * 


97. 60 


90. 6t 


AF271410 4 * 


97, 57 


95. 66 


AF271412 M 


92, 53 


87, 60 


U5353l 4T 


97.60 


93. 59 


U535SG 47 


97, 60 


95. 66 


NC_OOI569 4S 


97.60 


36. 59 


NCJ3G1645 4 * 


89,57 


30. 53 


NCJ)0iS07 so 


96, 55 


84,64 


U07564 5 ' 


97. 60 


39. 59 


AB002412 s: 


97.60 


76. 57 


AF01376Q" 


97.53 


37. 63 


AF013763" 


97, 57 


37.63 


AF200722 54 


39. 55 


35. 57 


AF229S37 i4 


39. 55 


36, 6I 


af:oo723 ,j 


39. 55 


S6. 6i 


AF023S02" 


39.55 


3 I. 56 


AF0I376I ,) 


39. 55 


S7, 63 


..\ro:379: n 


33. 54 


35. 57 
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183 


cro.cro 


Crossoptilon crossoptilon* 


189 


sym.rec 


Syrmaticus reevcsi* 


190 


bam.tho 


Bambusicola thoracica m 


191 


fra.fra 


Francolinus francolinus 


192 


ith.cru 


Ithaginis cruentus* 


193 


ant. par 


Anthropoides paradisca 


194 


ant. vir 


Anthropoides virgo 


195 


gru.anc.an 


Grus antigone an ii gone 


196 


gru.ant.gi 


Grus antigone gillae 


197 


gru.any.sh 


Grus antigone sharpei 


198 


gru.Icu 


Grus leucogeranus* 


199 


gru.can.pr 


Grus canadensis pratensis 


200 


gru.can.ro 


Grus canadensis rowan i 


201 


gru.can.ta 


Grus canadensis tabida 


202 


gru.can.ca 


Grus canadensis canadensis 






Grus anxericana 






Grus grus 


205 


gni.mon 


Grus monacha * 






Grus nigricollis* 


207 




Grus japonensis 


208 


cic bov 


Ciconia boyciana * 






Rhea americana 


*» 1 A 

Z1U 


ant-alb 


J nthmmr&ras albirostris* 


-11 


rdi.i«irn 


Pnlm femoralis 


Z1Z 


fal.ver 


t*rrl/*r% utrmPrtiniLS 




rai.par 


rulCO pe*<zg r 4/***j 


214 


tauspa 






ayt-arnc 


Ju'Jrv/r siirt&Ticuna 


216 


smi.sha 


Smithomis sharpei 


217 


vid.cha 


Vidua chalybeaia 


218 


chry.pic 


Chrysemys picta 


219 


cmy.orb.ku 


Emys orbicularis 


220 


chc.mud 


Ghelonia my das m 


221 


cum.cgr 


Eumeces egregius 



AF02S794 53 


89. 5j 


ft* ^7 


AF02S30I" 


89. 55 


ft j, j i 


AF028790 51 


80. 43 




AF013762" 


97. 58 


50, 0 1 


AF063193 53 


93. 63 


ft J, J I 


U27557 56 


85. 56 


ft-, Jo 


U2754S 56 


84. 54 


82. 52 


Ul!060 i7 


90. 53 


87, 6j 


UU064 37 


90, 53 


87, 63 


UII061 37 


90, 53 


87, 63 


U27549 56 


90, 53 


87, 63 


U27553 56 


97. 60 


37, 63 


U27552 56 


97, 60 


87,63 


U2755I 5 * 


98, 63 


" 87,63 


U27554 56 


97,61 


87,63 


U27555 5 * 


90. 53 


87, 63 


U27546 56 


89, 54 


87, 63 


U27543* 5 


90, 53 


37, 63 


U27S47* 


90, 58 


87, 63 


U27550 5 * 


81,54 


87, 63 


NC_002i96 58 


94, 53 


79, 60 


AF090339 59 


93. 63 


79, 60 


U39190 60 


97,61 


86, 59 


U833I0*' 


97,61 


36, 60 


US3311 61 


97,61 


85, 57 


1133307* 


97,61 


84, 52 


U83306* 


92. 59 


30,51 


KCJ00877 6 - 


93, 63 


94, 62 


NCJ)00S79 59 


97. 53 


90,61 


NC_0003SQ 59 


97, 60 


87, 64 


NC_002073 43 


39, 56 


86, 57 


AJ13U25 64 


90. 59 


94, 63 


ABO 1 2104*' 


90. 53 


94,63 


ABO 1 66Q6* S 


36. 55 


73.51 



WO 02/077278 



28 



PCT/IN01/00055 



Table 2. Multiple sequence alignment of 47: bp fr 
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f fnicochor.drial 
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b eerie o 



f 221 animal species 
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. ere 
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. nas 




cry 


.dam 
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. bus 




5 


. lie 
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ce n 
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a mm 
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- " a Y 




cap 
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• _ _ 
nem 


an 




cap 


. :ai 




ru? 


.pyr 




ru? 


.ru? 




r.em 


. cau 




bud 


. tax. 


-a;c 


pan 


. bed 
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. a mm 




ovi 


.vie 




cap 


.cri 




cvi 


. mcs 




ere 


. ame 




ce? 


.dor 
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. max 




bis 


.ben 




bos 


. gru 




bes 


. era 




bub 


.min 




tuba . bub 


era 


. ang 




era 


. eur 




keb 


. ell 




fcob 


. meg 




red 


. aru 




red 


. f ui 






. mcs 




pel 


- cap 




ga = 


. dam 




cu r 


. ou r 




ar. 3 


. ce r 




sal 


z a _ 




mad 


. 




rap 


mel 




g.ac 






a ri- 


a rr. *e 




ft yd 


Lr.e 




mur. 


m'-in 




i L c 


i L c 




ce r 


el i 





TACCATGAGGACAAATATCA I T~ 



TCCCATGAGCACAAATATCATTCTCAGGAGCAACAGTCATTACAAA 
TTCCGTGAGCAGAAATATCA' 



TGCCA~GAGGACAAATATC^VrTCTGAGGAGCAACAGTCATCACCAACC - TCTCTCAGCAA i 3 

TAC^ATGACGACAAACATCATTTTCAGGGGCAACAGTTATCAGTAA 60 

-ACCA-GACGACAA-nTATCATTCTGAGGAGCAACAGTCATCACCA^CCT-C . C . w\uwu* b 0 

~GCCA^GAGGACAAATATCATTCTGAGGGGCAACAGTCATCACCAATCTCCTCTCAGO^A. =0 

TGGCATGAGGACAAATATCATTCTGAGGGGGAACAGTCATCACCAATCTCCTCTCAGC.-urt , 0 

tgcc^tg2ggac.xaatatcattctg^ J l 

TACC^TGAGGACAAAT^T^CT " 

TGGCATGAGGACAGATATCATTCTGAGGGGCAACAGTCATC^CCAACCTr. . ^ft^ - 0 

TGCCATGAGGACAAATATCA^^ J J 

ta^caSaIS^ 

TACCATGAGGACAAATATCATT 

KcStgagg^^ " 
t^Stgag^^^^ 

TACCATGAGGACAAATAT CATT CTG AGGAGC AACAGC AATTA.C ^"^^ ~ ^AGCAA \\ 
TiCCATO AGGACAAATATCATTCTGAGGAGGAACAGTTAT-ACCAACC :CwUi o a 



TACCATGAGGACAAATAT CAT 
TACCATGAGGACAAATATCC 



TACCATGAGGACAAATAT CCT 



TACCATGACCACAAATATCCT 



TCCCATCCCCACAAATATCCT 
TCCCATGGGGACAAATACCCT 



TACCATGAGGACAAATATCCT 



TACCATCACCACAAATACCTT 



TACCATGAGGACAAATATCC 
TACCATCAGGACAAATATCTT 



taccatcaggacaaatatctt 
tgccatgaggacaaatatcct 

TACCATGCCCACAAATACCCC 
TACCATCACCACAAATACC*: : 
TACCATCACCACAAATATCAT 

ttccatcacca taaatatcat 
taccatcacgacaaatatcac 

TACCATCACCACAGACACCA 7 
7ACCATCACCACAAATA 7 CAT 



"CT 



CTGAGGAGCGACAGTCATCACTAATCTCCTTTCAGCAA € 0 



"CTGAGGAGCAACAGTTATCACTAATCTTCTCTCAGCAA 5 0 
CTGAGGACCAACAGTTATCACTAACCTTCTCTCACCAA * 0 
TCACCAATCTACTATCACCAA 60 
TCACCAACCTTCTCTCAGCAA 5 0 
CTCAGCAA €0 
CTCAGCAA -5 0 
CAGCAA o > 
CAGCAA 50 



CTGAGGACCAACAGT' 

CTG ACGGG C AA C ACTT AT CA CT AAC CTC 




C . CA3CACCAA .Aw 

CTCAGCCCCAACA 3TCATTA tTAACCTA'- 
CTCAGGAGCAA ZCCTCATTACTAA" "CC 
CATCA2TAACC 

■a :taac 



"CACCCC 7AA ZACTCAT' 
CTC ACC AG T AACAGT 7 A CT A " 



CTCAGCAA « 
CAGCAA M 
TCAGCAA -50 
CTCAGCAA SO 
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TACCATGAGGACAAATATCATTCTGAGGAGCAACCCTCATTACCAACCTTCTCTCAGCAA 6 0 
TACCATGAGGACAAATATCATTCTGAGGAGCAACAGTCATTACCAACCTTCTCTCAGCAA 6 0 
TACCATGAGGACAAATATCATTCTGAGGAGCAACAGTCATTACCAACCTCCTCTCAGCAA 6 0 
TACCATGAGGACAAATATCATTCTGAGGAGCAACAGTCATTACCAACCTCCTCTCAGCAA 6 0 
TACCATGAGG ACAAATATCATTCTGAGG AGCAACAGTCATTACCAACCTTCTCTCAGCAA 6 0 
TACCATGAGGACAAATATCATTCTGAGGAGCAACAGTCATTACCAACCTTCTCTCAGCAA 6 0 
TACCATGAGGACAAATATCATTCTGAGGAGCAACAGTCATTACCAACCTCCTCTCAGCAA € 0 
TACCATGAGGACAAATATCATTCTGAGGAGCAACAGTCATCACCAACCTTCTCTCAGCAA 6 0 
TACCATGAGGACAAATATCATTCTGAGGAGCAACAGTTATTACC^ATCTTCTCTCAGCAA 6 0 
TACCATGAGGACAAATATGA7TCTGAGGAGCAACAGTTATCACAAACCTCCTCTCAGCAA 6 0 
TACCTTGAGGACAAATATCTTTCTGAGGAGCGACAGTTATTACCAATCTTCTCTCAGCAA 6 0 
TACCTTGAGGACAAATATCTTTCTGAGGAGCAACAGTTATTACCAATCTTCTCTCAGCAA 6 0 
TACCTTGAGGACAAATATCTTTCTGAGGAGCAACAGTTATTACCAATCTTCTCTCAGCAA 6 0 
TACCTTGAGGACAAATATCTTTCTGAGGAGCAACAGTTATTACGAATCTTCTCTCAGCAA 6 0 
TACCTTGAGGACAAATATCTTTCTGAGGAGCAACAGTCATCACTAACCTTCTCTCAGCAA 6 0 
TACCCTGAGGACAGATATCTTTCTGAGGAGCCACAGTCArCACCAACCTCTTATCAGCTA 6 0 
TACCCTGAGGGCAAATATCTTTTTG AGGAGCTACAGTCATCACTAACCTT CTT TCAGCAA 6 0 
TACCCTGAGGACAAATATCATTTTGAGGTGCAACCGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGAGGACAAATATCATTTTGAGGCGCAACCGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGAGGACAAATATCATTTTGAGGCGCAACCGTCATCACCAACCTCTTATCAGCAA 6 0 
TACCCTGAGGACAAATATCA.TTTTGAGGCGCAACCGTCA.TCACCAACCTCTTATCAGCAA 6 0 
TACCCTGAGGACAAATATCATTCTGAGGCGCAACCGTTATCACCAACCTCCTATCAGCAA 6 0 
TGCCCTGAGGACAAATATCATTCTGAGGCGCAACCGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGAGGACAAATATCATTCTGAGGCGCAACCGTCATCACCAACCTTCTATCAGCAA € 0 
TGCCCTGAGGACAAATATCATTCTGAGGCGCAACTGTAATCACTAACCTCCTATCAGCAA 6 0 
TGCCCTGAGGACAGATATCATTCTGAGGCGCAACCGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGGGGACAGATATCATTTTGAGGTGCAACAGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGGGGACAGATATCATTTTGAGGTGCAACAGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGAGK3ACAGATATCATTCTGAGGTGCAACAGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGAGGACAAATATCATTTTGAGGCGCAACAGTCATCACCAACCTCCTATCAGCAA 6 0 
TACC CTG AGG AGAAATATCA.TTTTG AGG TG CAACAGTCAT CACCAACCTC CTAT CAG CAA 6 0 
TACCCTGAGGAC^GATATCATTTTGAGGTGCAACAGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGAGGACAGATA.TCATTTTGAGGTGCAACAGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGAGGACAGATATCATTTTGAGGTGCAACAGTCATCACCAACCTCCTATCAGCAA 6 0 
T AC C C7GAGGG CAG AT ATCATTTTG AGG TG CAAC C G TC AT CAC CAAC CT CCTAT CAG CAA 6 0 
TACCCTGAGGACAGATATCATTTTGAGGTGCAACCGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGAGGACAGATATCATTCTGAGGCGCAACCGTCATCACCAATCTCCTATCAGCAA 6 0 
TACCCTGAGGACAGATATCATTCTGAGGCGCAACCGTCATCACCAATCTCCTATCAGCAA 6 0 
TACCCTGAGGACAGATATCATTCTGAGGCGCAACCGTCATCACCAATCTCCTATCAGCAA £ 0 
TACCCTGAGGACAGATATCATTCTGAGGCGCAACCGTCATCACCAATCT CCTAT CAGCAA 6 0 
TACCCTGAGGACAAATATCATTCTGAGGCGCAACCGTCATCACCAATCTCCTATCAGCAA 6 0 
TACCCTGAGGACAGATATCATTCTGAGGCGCAACCGTCATCACCAATCTTCTATCAGCAA 6 0 
TACCATGAGGACAAATATCATTCTGAGGCGCAACCGTTATCACCAATCTCCTATCAGCAA 6 0 
TACCCTGAGGACAGATATCTTTCTGAGGCGCAACCGTCATTACTAATCTCCTATCAGCAA 6 0 
TACCCTGAGGACAGATATCCTTCTGAGGTGCAACCGTCATCACCAATCTCCTATCAGCAA € 0 
TGCCCTGGGGACAAATATCATTCTGAGGCGCAACCGTCATCACCAACCTCTTATCAGCAA 6 0 
TG CCCTG AGG A CAAATAT C ATT CTG AG GCG CAA CCGTCATCAC CAAC CTCTTATCAG CAA 6 0 
TGCCCTGAGGACAAATATCATTCTGAGGCGCAACCGTCATCACCAACCTCTTATCAGCAA 6 0 
TGCCCTGAGGACAAATATCATTCTGAGGCGCA=vCCGTCATCACCAACCTCCTATCAGCAA 5 0 
TGCCCTGAGGACAAATATCATTCTGAGGCGCAACCGTCATCACCAACCTCTTATCAGCAA 6 0 
TGCCCTGAGGACAAATATCATTCTGAGGCGCAACCGTCATCACCAACCTCTTATCAGCAA 6 0 
TGCCCTGACCACAAATATCATTCTCAGGCGCAACCGTCATCACCAACCTCTTATCAGCAA 6 0 
TGCCCTGAGGACAAATATCATTCTGACGCGCAACCGTTATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGAGGACAAATATCATTCTGAGGTGCAACCGTCATCACCAACCTCCTATCAGCAA 6 0 
TGCCCTGAGGACAAATATCATTCTGAGGCGCAACCGTCATCACCAACCTCTTATCAGCAA 6 0 
TACCCTGAGGACAAATATCATTCTG AGG CG CAA CCGTCATCACTAATCTCCTATC AG CAA 6 0 
TACCCTGAGGACAAATATCATTCTGAGGTGCAACC37CATTACCAACCTCCTGTCACCAA 6 0 
TACCCTGAGCACAAATATCATTCTGAGGCGCAACCGTCATTACCAATCTCCTATCAGCAA 6 9 
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TACCCTGAGG ACAAATATCArTCTCAGCCaCAACCGTCArTACCAATCTCCTATCAGCAA S 0 
TACCCTGAGGACAAATATCATTCTGAGGTGCAACCGTCATCACCAACCTCCTATCAGCAA 6 0 
TACCCTGAGGACAAATATCATTCTGAGGTGCAACCGTCATCACCAAC CTTTT ATCAGCAA 6 0 

TACCCTGAGGACAAATATCATTCrGAGGTGCAACCGTCATCACCAACL'I I I TATCACCAA o 0 
TACCCTGAGGCCAAATATCATTCTGAGGAGCAACCGTCATCACCAACCTTATATCCGCAA b 3 
TGCCCTGAGGCCAAATATCATrCTGAGGAGCAACCGTCATCACAAACCTTATATCCGCAA 5 0 
TGCCCTGAGGACAAATATCA— CTGAGCCGCAACCGTTATCACAAACCTTCTATCAGCAA 6* 0 
TACCCTGAGGACAAATATGATTTTGAGGCGCAACCGTCArCACTAATCTTCT A - CAGCAA b 0 
TGCCCTGGCGACAAATATCATTTTGAGGTGCTACCGTCATCACAAACCTCTTArCAGCnA 6 3 
TGCCTTCAGGGCAAATATCA'rTCTGAGGTGCAACCGTCATCACCAACCTCCTArCCGCTA 6 0 
TACCTTGAGGACAAATATCATTCTGAGGTGCAACCGTCATCACAAACCTCTTATCCGCTA 6 0 
TT C C CTGAGGACAAATATCATT CTG AGGCGCAACCGTT ATT AC CAACCTCCTAT C CG C CA o 0 
TACCCTGAGGACAAATATCATTCTGAGGCGCAACTGTTATTACTAACCTCCTATCCGCTA 6 0 
TACCATGAGGACAAATATCCTTCTGAGGTGCAACTGTCATTACCAATCTTCTATCCGCTA 6 0 
TACCCTGAGGACAAATATCATTCTGAGGCGCAACCGTCATCACCAAT CTCCT ATCCGCCA 6 0 
TAC CTTG AGG ACAAATATCATTCTG AGG CG CAACTGT CATT ACTAAT CTTTT AT CTG CT A 6 0 
TACCCTGAGGACAAATGTCATTCTGAGGTGCCACTGTCATCACTAACCTCCTATCAGCGA 6 0 
TAC CATGAGGACAAATAT CATT CTGAGGGGCAACAGTC ATCAC CAACTTACTATCAGCTA o 0 
TG C CATGAGGACAAATGTCATTCTGAGGGG CAACAGTCATTACCAACTTACTGTCAGCTA S 0 
TAC CATGAGGTCAAATATCCTTCTG AGGAGCCACAGTTATCACAAATCTCCTCTCAGC CA 6 0 
TAC CATGAGGTCAAATATCCTT CTGAGGGGCTACAGTCATT ACAAATCT C CTCTCAGC CA 6 0 
TAC CATGAGGC CAAATATCCTTCTGAGGGGCTACAGTCAT CACAAAC CTC CTCTCAGCTA S 0 
TACCATGAGGACAAATATCCTTCTGAGGAGCAACGGTCATTACAAA,CCTCCTATCAGCAA . 50 
TACCTTGAGGACAAATATCATTTTGAGGAGCTACCGTCATTACAAACCTACTATCAGCCA 6 0 

TACCCTGAGGACAAATATCGTTCTGAGGAGCCACAGTCATCACAAAC S 0 

TGCCCTGAGGACAAATATCATTCTGAGGAGCTACGGTCATCACAAATCTACTATCAGCTA 5 0 
TGCC CTGAGGACAAATATCATT CTGAGGAGCTACGGT CAT CACAAATCTACTATCAGCTA 5 0 
. TCCCATGAGGACAAATATCATTTTGAGGGGCAACAGTAATTACAAATCTACTCTCGGCAA SO 
T C CCATGAGGC CAAATATCATTTTG AGGGGCAACAGTAATTACAAAC CTACTCTCGGCAA 6 0 
TCCCATGAGGACAAATATCATTTTGAGGGGCAAaAGTAATTACAAA * 0 

T C C CATG AGGACAGATATCATTCTGGGG AG CAACAGTAATT AC CAAC CTACT CT CAG CAA $ 0 
TTCCATGAGGACAAATATCATTCTGAGGAGCG ACCGTCATTACCAACCTC CTAT CAG CAG 6 0 
TTCCATGAGGACAGATATCATTCTGAGGAGCAACCGTCATTACCAACCTCCTGTCAGCAA S 0 
TTCCGTGAGGACAAATATCATTCTGAGGAGCAACCGTCATTACCAACCTCCTATCAGCTA S 0 
TTCC ATG AGG ACAAATATCATTTTG AGGAGCAAC CGTCATTAC CAACCTC CTATCAGCAG 60 
TAC CATGAGGACAAATAT C CTT CTGAGGAG CAAC CGTCATCAC CAAC CTTCTGTCAGCAA 6 0 
TACCATGAGG ACAAATATCATTCTGAGG AGCAACAGTCATCACTAAT CTACT AT CAG CAA 6 0 
TAC CATG AGGGCAAATGTCATTCTG AGG AG CAACAGTTAT CACT AAT CTACTAT CAGCAA S 0 
TAC CATGAGGACAAATAT CATTTTG AGG AGCAACAGTC ATCAC C AACCT ACTAT CAG CAA ^ 0 
TACCGTGAGGACAAATATCATTTTGAGGAGCGACAGTCATCACCAACCTACTATCAGCAA 6 0 
TGCCATGAGGACAAATATCATTTTGAGGAGCA^CCGTTATTACCAACTTACTATCAGCAA 6 0 
T ACCATG AGGACAAAT AT CATT CTG AGG AG C AACCGTC ATT ACC AACTTACTATCAGCAA 6" 0 
TGCCATGAGGACAAATATCATTTTGAGGAGCAACCGTCATTACCAACCTACTATCAGCAG S 0 
T ACCATG AGGGCAAAT AT C ATTTTG AGG AG C.AACCGTT ATCAC CAAC CTACT AT CAGCAA 5 0 
T ACCATG AGG ACAAATACCCTT CTG AGGGGCG AC CCTCATCACCAACC . ACTATCAGCAA 6 0 
TACCCTCAGCCCAAATCTCCTTCTCAGCAGCAACTCTCATTACCAATCTC- -ATCACCCA o 0 
TACCCTGAGGCCAAATATCCTTCTGAGGACCGACTGTCATTACCAACCTCCTATCAGCCA 6 0 
TGCCCTGAGGACAGATATCATTCTGAGGACC.AACCGTTATCACCAACCTACTATCAGCGA SO 
TACCATCAGCCCAAATGTCCTTCTCACCAGCAACCGTAATCACTAACCC^CTG .CAGCAA 50 

TACCATGACGACAAATATC ATTTTG ACC ACCAACTCTAATCACTAATC . , CTCTCTGCCA o 0 
TACCATGGGGTCAAATATCCTTTTCACGTGCAACGGTAATTACAAATTTACTGTCAG^A ^ Q 

TACCCTGAGGACAAATATCTTTCTCAGCAGCCA "CTCATCAC CAAC CTTCTGTCAGCAA y 

TACCCTGACGACAAATAT-rCTTCTCACC ACCTA JTGTCATCACCAACCTTCTC -^'j^* \ ^ 

TACCATCACCACAAA' 

TACCATCACCACAAA 

TACCATCACCACAAA 

TACGCTC AGGACAAAT A . . . ,^ 

TTCCATGAGGCCAAATATTATTCTGAOGTGCTA 7 CGT A A 7 TACT A A C TTCC . CTCA^C vA 
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TCCCATGACCACAAATATCA— CT _ 

TCCCATGAGGACAAATATCATTCTGACGCGCTACCGTAATCACAAATCTCC^C * 
TTCCGTGAGGACAAATATCArTCTGACGCGCTACCGTAATCACTAACCTCCTCTC^ov-^ 
TCCCATGAGGACAAArGrCATTCTGAGGCGCAACCGTAATTACAAATCTCCTCTCAGCAA 
TCCCATGAGGACAAATATCA— CTGAGGAGCCACAGTAATTACCAACCTACTATCAGCAA 
TCCCATGACGACAAATATCATT CTGAGGTGCCACCGTCATCACTAACCTACTATCG^CAA 
TTCCATGAGGACAAATA7CA— CTGAGGrGCCACAGTTATTACAAACCTCCTATCAwCCA 
TCCCATGAGGCCAAA7A7C CTTCTG AGG AGCCACAGTAATCACAAACTTGC7 ATCCGC w-i 
T^CCGTCAGGCCAAArArCATTCTGACGGGCCACAGTAATTAGAAAC I rACTATCCGCCA 
TCCCATGAGGACAAATArCATTCTGAGGAGCAACCGTTATTACTAACCTCCTGTCAG^.A 
TTCCATGAGGAGAAATATCATTCTGACGGGCAACCGTAATTACTAACCTCTTCTCAGCAA 
TCCCATGAGGCCAAATATCATTCTGAGGGGCAACTGTCATCACAAACGTATACTCAGCAA 
TCCCATGAGGTCAAArGTCATTCTGAGGGGCAACTGTTATCACAAATCTA-L .^.CAGCAA 
TCCCATGAGGACAAATATCATTTTGAGGGGCTACCGTCATCACAAACTTATTCTCAGCAA 
TCCCATGAGGACAAATArCArTTTGAGGGGCTACCGTCATTACAAATTTATTCTCAGCAA 
TCCCATGAGGACAAATATCACTTTGAGGAGCTACCGTCATCACAAATTTATTrTCAGCAA 
TCCCATGAGGACAAATArCATTTTGAGGGGCTACCGTCATCACAAATTTAi l CTCAGCAA 
TCCCATGAGGACAAATATCLArTTTGAGGAGCTACCGTCATCACAAACCTATTCTCAGCAA 

TTCCATGGGGACAAATATCArTTTGAGGGGCTACTGTCATCACAAA 
TCCCATGAGGACAAATATCATTTTGAGGGGGTACCGTCATCACAAATCTAT I CTCAGCAA 
TCCCATGAGGACAAATAXCATTTTGAGGGGCAACCGTCATCACA 



ltLLAi^HU<JV.UWAlrt.UA. _w*w~ww.-^ — ~rv*nj~/-n * 

tao»3ga«»caaa»tc^^ 

TAC^TGGGGACAAATGTCn-TTTGAGC^ 

TACCA^GAGGACAAATATCArrrrGAGGGGCTAC\GTCATCACC^ATCrC E rC^»v.w 

TACCAttsRGGA(^»rcArrrrea^^ 

TACCATGAGGACA^TA™GATTiTGAGGGGCTACAGTCATCACCAATCTCj.^CT^G^w j 
TGCCATGAGGACAAATATCATTCTGAGGGGCTACAGTCATTACCAACCTC i. * CiCAGC.^ 
TGCCATGAGGACAAATAT^rrCTGAGC^GCTAC^GTCATTACCAACCTCTTC^GCC^ 
TACCATGAGGACAAATATCArrCTGAGGGGCTACAGTCATTACCAACCTCT I C I C^G^w- 
TACCATGGGGACRAATATCAT?CTGAGGGGCTAC«GTCATTACCAACCTCTTCTCMCCrf 

TACCATG AGGACAAATATCA7TTTG AGGGGCTAC^GTTATCACCAA . 
TACCATGGGGACSAATGTCATITTGAGGGGCTACAGTTATCACCAATCTC^T^^s^^CCj 

TACCAraWGACJVAATATCATTT^^ 

TACCATGAGGACAAATATCATTTTGAGGGGCTACAGTTATCACCAAC.- . - i 
TACCATGGGGACAAATATCCrrrTGAGGGGCTACAGTrATCACC.AArCrC^^ r -— - 
TGCCATGAGGACAGATATCArrCTGAGGGGCTACAGTCArCACCAAC^^^j-^U.A 
TACCATGAGGAC^AATATCATTCTGAGGAGCTACACTTA—ACCAAC^.A^^C^v-^ 
TACCATGAGGGCAAATATCATTCTGAGGCGCCACCCTCATCACCAACCT^C^^G^A 
TACCCTGAGGACAAATATCArrCTGAGGGGCTACAGTTATCACCAACCTA : ^--G^ 
TACCCTGAGGACAAATATCATTCT'""" --"^ '* ir.-r- i - i "A ACC 



TACCCTGAGGACAAATATCATTCTGAGCAGCCACAGTCA : i ACCAAw- ^ 
TACCCTGAGGACAAATGTCATTCTGAGGAGCCACAGTCArrACGAAC- r 2^-^^^ 
TACCATGAGGACAAATATCATTTTGACGGGCCACCGTCATCACTAA 
TCCCATGAGGCCAAATATCATTCTGAGCTCCTACAGTAATCACCAA 
TGCCATCACGACAAATATCATTCTSAGCACCCACAGTAA - CA C 
TACCATGGGGCCAAATATCCTTCTGAGGTSCeACCG . - ATT ACT AA 
TACCATGACGCCAAATATCTTTCTGAGGTGCCACC 



TACCATCAGCACAAATAT-TAT - . -j^w — ■ - — ^ " * C.~AA 

TCCCATCGGGACACATATCCTTCT'j ACCCCCA A TCGTAATTA "AAA - - 
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TCCCATACATCCCCACAAATCTAGTCGAATGAATTTGAGGGGGATTCTCCGTACACAAAG 
TCCCATATATTGCCACAAACCTAGTCGAATGAATCTGAGGGGGATTCTCCGTAGAC-aAAG 
TCCCATATArrGGCACAGACCTAGTAGAATGAATCTGAGGGGGATTCTCAGTAGACA^G 
TCCCATATATTGGCACAGACCTAGTAGAATGAATCTGAGGAGGATTATCAGTACAC^vG 
TTCCATATATTGGTACAAACCTAGTCGAATGAATCTCACGAGGGTTCTCAGTAGACWC 
T^CC^-ACA7CCGCAC5^ATCTAGTCGAATGGArCTGACCGGGCTTCTCAGTAGACW,G 
TCCCATACATTGGCACTAACCTAGTCGAArGAATCTGAGGGGGATTCTCAGTAGACAAAG 
TCCCArACATTGGCACAGACCTGGTCGAATGAATCTGAGGGGGATTCTCAGrAGACAAAG 
TCCCCTATATTGGCACAA-ArCTAGTCGAArGGATCTGAGGGGGATTCTCAGTAGACAAGG 
TCCCATATArTGGCACAAACGTAGTCGAATGAATCTGAGGGGGATTCTCAGTAGACAAAG 
TTCCATATATCGGCACAAACGTAGTCGAATGAATCTGAGGAGGATTCTCAGTAGACAAAG 
TCCCATATA^GGCACAAACCTAGTCGAATGAATC^GAGGAGGATTCTCAGTAGATAAAG 
TCCCATACA-rGGCA'TAGACTTAGTCGAGTGAATCTGAGGGGGCTTCTCCGTAGACAAAG 
TCCCG^ATA^GGCACAGACrTAGTCGAATGAArCTGAGGAGGCTTCTCGGTAGACAAGG 
TCCCATATATTGGCACAAACCTAGTCGAATGAATCTGAGGGGGAXT-CTCAGTAGACAAAG 
T^CCA"\CA^GGCACAAACCTAGTTGAGTGAATCTGAGGAGGATTCTCAGTAGACAAAG 
T r~ C ^ V 7 " ^C^^GGCACA.GAC CTAGTCGAATGAATCTGAGGGGG ATTCTCAGTAGACAAAG 
TTCCA^ATATTGGCACAAACCTAGTCGAATGAATCTGAGGGGGATTCTCAGTAGACAAAG 
TTC CATATA7TGGCACAAAC CTAGTCGAATGAATCTGAGG AGG ACTCTCAGT AG ACAAAG 
TCCCATATATTGGCACAAACTTAGTAGAATGAATCTGAGGAGGATTCTCCGTAGACAAAG 
TCCCATACATCGGCACAAACCTAGTCGAATGAATCTGAGGAGGATTCTCCGTAGAC^AAG 
TTCCATACATCGGTACAGACCTAGTCGAATGAATCTGAGGGGGG TTCT CAGTAGACAAAG 

TC C CATA CATTGGTACSiAACTTAGTCGAATGAATC 

TCCCATATATCGGCACAAACTTAGTTGAGTGAATCTGAGGGGGCTrTTCAG.AG^C^G 
TCCCATACATCGCK»CA^^ 

TTCGATACATCGGCACAAATTTAGTCGAATGGA7TTGAGGTGGG77C7CAG * AUrtC-viAG 
TCCCATACATO^^ 

TCCCATACATTGGCACAAACCTAGTTGAGTGA^^ 

XCCCATACATTGGTACAAGTCTGGTTGAATGAATTTGAGGGGG ATTCTCAG i ^^5^"^? 

T CCCATATATTGGCAC CAAC CT AGTTG AATG AAT CTG AGG AGG E r 

TCCCTTATArTGGCACCA.GCCTAGTCGAATGAATCTGAGGGGGC— TC^G.AG^OnAAG 
TTC CATACATTGGCACAAAC CTAGTCGAATGAATCTG AGG AGG ATTTT ^^^C^Cr^aAC 
TCCCATATATCGGCACAAACCTAGTCG AATGAATCTGAGGAGGATTG ^^^^^EC"YY5 
TCCCATACATCGGCAC^CCTAGTCGAATGAATCTGACGAGGATTCTCAG.C^A.^G 

TCGCATACATCGGCACAAP.GCTAGTTGA^TGAATCTGAGGAGGP.TTCTCAG ww^^^AG 

TC C CATATATCGG CACAAACCTAGTCGAATGAATCTGAGGGGGTTT G . -^^A^AAG 
^CCCATACATTGGTACAAACCrrAGTCGAATGAATCTGAGGGGGATTTTCAG^^GACrtApG 

TCCCATACATCCWCACAGACCT^ 

TTCCATACATTGGTACAAACCTAGTCGAATGAATCTGAGGAGGGTTC ^^E^ 
TCCCATACATCGGTACAAACCTAGT AG AATG AATCTG AGG AGGGTTC t 
TCCCATATATCGGCACAGACCTAGTAGAATGAATCTGAGGGGGTTTTTC^C^AGA AA^G 
TC^ATATATCGCCACAAACTT AGTTG AATGAATCTGAGGGGGCTTCTCAG . nG**CrtAr\G 
TTCCCTACATTGCCAC.^\CCTAGTAGAATGGATCTGAGGAGGATTTTCAG t .CATAAAC 



TCCCATACATCGGCACA^ACCTAGTAGAATGAATCTGAGGGGGATTC^C^AG^^^ 

C CATACATTGGTACTAACCT ACT AG AATG AATCTG AGGGGGATTC - CAG 
•CCATACGTCCCTACAA-ATCTAGTCGAATGAATCTGAGGTGGCTTTTCAG^GA^^G 

-rr ATA~A-TGGCACAAACTTAGTCCAATC AATCTG AGO ACCCTTT . CAG^ . ^ - 

-TAGTTGAATGAATTTGAGGCGGTTTTTCAGTAGACAAAG 



:CATACATTCCTACTAA 
:CATACA 



TCAGTAGATAAAG 




CCATACATTCCCACAAACCTAC 
CCATATATTCGCACAAA 
CCA T AT ATTCC C AC AA AC CTACTCC AA TCO A T 
C CA TAC ATTCCCACAAACCTAGTCC AATCC A TC 
7CATACATTCCCACAAACCT* 
1 C AT AC ATTGG CAT AAAC CT AG" 
7CATATA7TCCCA CAAA TCTAO 

: ca r a< :a ttcgtaca a 



:tao 



"7TCACTAGATAAAC 
TTC A' "AC AT AAAC 



^ACOCCC-. - - - 
CAGGGGGCTTCTCAGTACATAAAG 

CACGAGGCTTTTCAGTACATAAAG 

ACATAAAG 

TAGTAGATAAAG 




7 C AC TAG AC AAAC 
7 ACT A 3 AC AAAC 
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TTCCATATATTCGTACAAATCTAGTCGAATGAATTTGACGACGpj^^^^^^^^^^ 
TTCCATACATTGGTACTAAT-CGTTCAATaAATTTGACGACG^-^A^Au^^ 

TTCCATACATTGGTAGTAAtG~GGrTGAATGAATTTGAGGACwC^O..A^A 
TTCCATACATTCGTACTAACrrGGTTGAATCAA^ 

TTCCTTACATTGGTACTAAT-GGT-CAATGAArCTGAGGAGGC^^^-AG.^ 
TTCCCTACATTGGTAGTAACCTGGTTGAGTGAATT^ 

TCCCATACATTCCCACAGAC77GG - C3 AATG AA7CTC Aww . ^^^Ir-- C - C 1A,\G 
TCCCCTA^TCGC^CC^ 

TCCCATATAiTGGTACTACCTTAG iC-ji-vrti-j/vn* - ^^w^^--- gaCAAAG 
TCCCATACATTGGTACCAGGTTAGTTGA.ArGA.^CTGAGGTG : ^^^ : - rr ^ 

TCCCATACATTGGrACTACCGTAGTGGAATGGATGTGACG^G^,-^---"?— G 
TCCCATACATTGGTACTACGCTACTCGAATGAATCTGGGGCGG^^C ^ AGA..^G 

TCCCATACATTGGGAGTACCCTAGTCGAATGGGTCTGAGGGGGTTT^^^ iG^AGrt^^^^ 

TCCCATACATTGGTAGTAGCC7AGrCGAATGA.^GTGGGGGGG^^C, : -AG c C^G 
TCCCATACATTGGTACCAGCC-AGTCGAATGAATGTGAGGCOT-^CTCT^G« ^G 
--rri-A^AT^GGTACCACCGTAG-rTGAArGA-\TCrGGGGTGGCTTCTCCGT.-.GAO^G 

tccg^acatcggtag^^^ 

TCCCCTA»TCCKr:ACTM^ 

TGCCCTACATCGG-AGTAGCTTAGTAGAAT^^ 
TCCCCTACATCGGCACTACCTTAGTAGAATGAATCTGAGGAGG^ 

TCCCCTACATCGGTACTAC^^ 
TCCCCTACATTTC-rACTACCTTAGTAG^^^ 

TCCCCTACATCGGTACTACC-TAGTAGAATGAATCTC-AC-GCGC^ 

TCCCCTACATCGGTAGTACCTTAGTAG^ 

TCCCrTACATCGGCACCACC~AGTAGAATGAA^ 

TCCCTTACMCCGCACTACC^^ 

tScttacatcggaaccaccttagtagaatgaat^^^ 

TCGCCTACATCGGTAGTACTrTAGTAGAATGAATGTGAGGTGG^ 

^CCCCTACATCGGTACCACTTrAGTAGAATGAATCTGAGGAGGA- - - -- Cj -'*£r™! 
TCCC^ACATCGGCACTACCGTAGTAGAATGAATCTGAGG 

TCCCTTACATCGGCACCACCTTAGTAGAATG^TGTGAGGTGGA — ^ACAAAG 
TCCCTTACATCGGGACTAGCCTAGTAGAATGAATCTGAGGTGGA^^^^^-'^^^^^A^G 

TGGCTTATATTGGCAC-AGC™AGTCG.AATG.AA-CrGAGG-GGA - ;::r ^C^^ 
TCCCTTATATTCGCACTACCTT^ 

TCCCTTATATTGCCACTACCTiAGTCGAATGAATCTGAGG "^^J^^~^X~^qa^"AAAG 
rCCGTrATATTGGCACTACCrTAGTGGAATGAATCrGAGG.GG ? ^-X^^aCAC AAa"g 

TGCCTTATATTGCCACTACCTTAGTCGAAT^ - - ' r -~> •• ■ — - ' 



\GGAATGTGAGGTGGAGGGTGGGTAGAGAAAG 

"AGTTGAATGAATCTGAGGGGGATTTTCGGTAGACAAAG 

, r. , . , a — GG A — TTCGGTACACAAAG 

"AG . . jA.^ . G.-^rt - - I j^U j . ->^/^ - - - - — _ 

i- , ---- — rAGC-GGA TTCCGTAGACAAA'-i 

TCCCTTATATCGGCACTACCT . «G . -jAA^-hw ^ — ------ 



rATATTGGCAC 
TGGCTTACATTGGCACTACG1 



TCCCTTATATTCGCACTACCC 



:gttatatgggt; 
:ttacatcgccactacc 



GCCTTACATCGGCACTACGT 



GTTACAGGGCTAAGAG 
GGCTTACATCGGGAAGACG 
GGCTTATATCGGGAGTAGGG 
rGGTTATATCGCGAGTACGG 

7GTTTACATCGGGACGACCG 
rGGCTATATCGGCAZrAGG- 
TGGTTACATG "CAAGTATT - 
TGGTTACATGjGCAGGA "GG 




-GTAGAATGAATG 
AGTAGAATGAATCT 
AC7ACAATGAATGTGA. 
AGTACAA"AATGTGAGG7GGG' 



GGTGGG' 




•GGTCGGGAGACAAAG 
~~C GGTAG AGAAAG 
Z~- GGGAGAGAAAG 
GTGCGGACACAAAG 
GTGTGTAGATAAAC 
•AGAGAAAG 



•GGG' 



J J - 




;CTACATAAAG 
AGA2AAAC 



CATAAGC 



jA'j'v ^G*-> . - _ 

;:t"at7TT-"aga-aaa. 
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TTCCTTATATCGCCACCACTCTTG"; 



TTCTCCGTACATAAAG 

TCCCCTATATCGGCACTACTCTAGTCGAATGAATCTGACGTCGTTTTTCAGTAGATAAAG 
TCCCCTATATTGCC ACTACTCTACTCG AATG AATCTGACCTCGCTTTTCCCTAG A . AAAG 
TTCCCTACATCCCCACTACCCTAGTTCAATCAATCTGAGC^ 

—ccSaS^^ 

TCCCTTATATTGGCACGACCGTAG7TGAATGAATTTGAGGTGGCTTCTCCGTAGAT.AAAG 
TCCGCTACATCGGA.^CTACCCTTG7AGAArGGA7CTGAGGTGGTTTCT^GTAGAC^G 
r— < — r\ /- ?. T^m ^ a r i r: ~ r*T i L ~ A G A ATG AAT CTG AGG AG G C : i t i CTG AC/\*i-vArtG 



• - GCCTATATTGGAACAGACCTAGTAGAA.TGAATCTGAGGAGGCTTTTCCGTAGACAAAG 
TCCCATACATCGGCACCGACCTTGTAGAATGAATCTGAGGGGGATTCTCCGTAGACAAAG 
TGCCCTATATCGGTACCAACCTTGTAGAGTGAATCTGAGGAGGATTCTCAGTCGACAAAG 
TCCCTTACATCGGCACCAACCTCGTAGAATGAATCTGAGGAGGATTTTCCG , .G AC AAAG 
TGCCCTACATCGGTACTACGCTCGTCGAATGAATCTGAGGTGGATTCTCAGTAGACAAAG 
TT C CCTATATCGG AACGG ACCTCGTAG AATGG ATCTG AGGP.GG CTTCTCCGT CGATAAAG 

T C C C CT AT AT C GG AACAG AC CT C G T AGAATG AA.T CT C- AGG GG G CTTTT C C GT C G ACAAAG 
TCCGTTATATCGGAACAGACCTCGTAGAA.TGAATCTGAGGGGGCTTTTCCGTCGACAAAG 

T~ C C AT AT GTT GG CACAACACT AGTCG AA.TG AATTTG AGG AGG ATTCT C CGTA.G ACAAAG 

TTCCA?ATGTTGG»CAACACTAGTC^ 

TTCCATACGTTGGTACAACACTAGTCGAG TGGATTTGAGG AGGATT CTC CGTAGATAAAG 
TTCGGTATATCGGCAC^ACACTAGTAGAATGAATTrGAGGTGGCTTCTCCG^GACAAAG 
C CCCT ACA^GGGA.CCAAC CT AGT AGAATG AAT CTG AGGAGGA.CTTTCAG^GA_TAA-r-.G 
TCCCCTACATCGGAA>CTAACCTAGTAGAATGAATCTGAGGAGGATTTTCAG . ^TAAGO 
TCCCTTACATCGGAACCAACTTAGTAGAJVTGAATTTGAG 
^CC^ACA^CGGAACCAACCTAGTAGAATGAAT^^ 

-TCCCTATGTAGGGACTGACTTGGTCGAATGAGTCTGAGGGGGGTTTTCAG^.GATAAAG 

m*^c " A^ A* 7 * CGGAAC CGACCTAGT ACAATGAATCTG AjGG AGG ATTTTCAG^^GATA^-.G 

m C C^'ACA' 7 * CGGAACGGAT CT AC-T ACAA.T GAATCTGAGGAGGGTT CTCAG i -GAT AAAG 

TCCCCTATGTCGGAACCGACCTTGTACAACGAATCTGAGGAGGGTTTTC^ 

T C C C CTACATCGG AGCCG ATCTAGTAG AA.TG AATCTG AGGGGG ATTTTCAGTCo^ 

T" 7 C CCT ACATCGGAACCG ACCTAGT ACAATGAATTTGAGG CGGATTTT w-lG , v-^CAAAw 

TTCCCTACATCSGA^ 

T^G^CGGAGACGACGTAGTACAATGAATCTG AGGAGGATTTT ,A^ ^ ^.-^JWG 
T CCC CTACATCGGGACTG ATCTAGTACAA.TGAATCTGAGGAGGATTCT ^^3T^^^Cq 
- ~CCTTACATCGC AACCSATCTAGTACAATGAATCTG AGGCGGGT . C - — G i .-C~ - ~AAU 
TCCCCTATATTGGAACGGACCTAGTAGAA.TGAGTCTGAGGAGGCTTTTCCGTAGAC^uw 

TTCCCTATATTGGAACTAAC^TGTA^ 

TTCCATACATCGGGACTGAACTAGTAGA.ATGAATCTGAGGGGGGTTCTC ?r ^G ? ^G 
TCCCTTATATCGGAACTGACTTAGTAGAATGGATCTGAGGCGGCTTC -C— .Cv^AAAu 



TT 



TT AC AT CGGT AC AG ACTT AGT AG AAT G AATTTG ACGTCGGTTCT CAGT AG ACAAAG 
TTCCTTATATTCGGACAACACTTGTAGAATGAATCTGAGGAGGC . . CT CTCTCG^O^Ao 
AACACTTGTAC AA" - -^.---r.r— TTCT C . . . . - . — ^ 
CAATCAAT 



:ttatat- 



GGTA 
t TTGGAACAGTCC 
CCCCTATATTCGAACACTC 



TCCCTTATATTGCAACTCATCTAGTA 
TCCCTTATATCGCCACCAACCTTGTTCAA' 
T T C C AT AT A TAG GG C CT ACT CT AGT AG AA 
T T CCATATCTACGTACAAC 
TTCCTTACATGGGTACCC; 
TTCCCTATATAGGAACT 




tga: 

agataagg 



CAAT 
CTCGT AG ^AT T AATT 
^CCTACTACAATCAATC 
-CCTACTACAATCAATC 
TCCCTACATAGGAACTAACCTAGTA 3 a jtcaa rc 
CCCTTACATCCCAACTAAOCTACTTCAA7CAATC 
T CC CT AT A TTOG CAT AAA C CT A 
CCTATATATTGGAACAACCCTAS .CGAATGAA 
CTCCTACATCGCAACAGACCTAwTCrAArTA^ 

ctca rACATTGGGACA'V--:cT-'; . . :aa r ;aa 



CCGTAGATAAGG 
CACAACG 
GAT .AAAG 
GT AG ACAAAG 
ACCGCCATT CTC AG TAG ACAAAG 
C AGG CCC ATTTTC ACT AG ACAAAG 
"CT CAGT AG ACAAAG 
~C CAOTAGACAAAC 
-CT TAG TAG ACAAAG 
TAG ACAAAG 



;acgc 




C A j- i . 

"a : ;a 



—tagtacacaaag 
:a :tcaotagataccc 

"V« rTCA-TA-ATACTC 
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dug . dug 
ele . max 
af r . ccn 
pavo . mut 
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era . cem 
arg.arg 
ca: . wal 
cro . cro 
sym. ree 
bam. tho 
f ra. fra 
ith. cru 
ant. oar 
ar.:.vir 
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cic.bcy 
rhe . ar.e 
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rCTCACTACACAAAC 
7ACATAAAC 




TCCCCTACATCCCCACCAACCTAGTCCAATCACTTTCAGCCCGAT 
TTCCCTACATCGGCACAAACCTAGTAGAATGAAriTGAGGACGC 
TCCCCTATATTGGT C AAAC C CT AGTAG AATGCC CCTG AGG AGG A - 

TCCC7TATATTCCACAAACCCTACTAGAATGACCCTGAGCCGCATT( ^ _ ^ 

TCCCATACATTGCCCAAACCTT AGTAG AATGAGCCTG AGG AGCC * . i . CAG H V,M ^C?1^ 
TCCCATACATTGGTCAAACCCTAGTAGAATGAGCGTGAGGCGGCT^.CAG . iGACAA-C 
TCCCATACATTGGCGAAACTCT ACTAGAATCCCCCTCAGCCCCCT , , TCAw . . — * - 
TCCCATACATTGCCCAAACCCTAGTACAATCACCTTGAGGGCGC TTTT CAG . iC^w^A i - 
T^CC^ATATTGCACAAACCCTAGTAGAGTGAGCCTCAGCAGGATTTTCAG : CGACAACC 
TCCCTTACATCGGACAGACCCTAGTAGAATGAC-CCTGAGGAGGATTCTCAGTTGACAATC 
TCCCTTACATTGGACAAACCCTAGTCGAGTGAGCCTGAGGGGGATTCTCAGTTGACAACC 
TCCCCTACATCGGACAAACCCTAGTAGAGTGGGCCTGAGGAGGATTCTCAG - iGACAACC 
TTCCCTACATCG G AC AAAC C CTAGT AG AATG AGCCTGGGGGGG ATTCTCAGTAG ACAAC C 
^CCCTACATTGGACAAACCTTAGTAGAGTGAGCCTGAGGGGGATTCTCAGTAGATAACC 
TTCCCTACATCGGCCAAACTCTGGTAGAATGAGCTTGAGGAGGATTTTCAGTAGACAACC 
^CCCATATATCGGCCAAACCCTTGTAGAATGAGCTTGAGGGGG TTTCT CAGTAGACAATC 
T CCCATACATCGGC CAAACC CTT G TAG AA.TGAGCTTGAJ3GGGGTTTTTCAGT AGATAATC 
T C CCCTACATCGG C CAAAC C CTTGTAGAATGAGCTTG AGGGGGCTTCTCAGTAGACAA^- ^ 
TCC C CTACATCGGCCAAACCCTTGTAGAATGAGCTTGA.GGGGGCTTCTCAGTAGACAA * 
T C CCCTACGGCGG C C\AACCCTTGTAGAATGAGCTTGAGGGGGCTT CT CAGTAGACAAT C 

TC C CCT ACATCGGC CAAAC C CTTGTAGAATGAGCTTGAGGGGG ^ ^^^^^T^CCX Z 
TCCCATACATCGGCCAAACCCTCGTAGAATGGGCTTGAGGGGGCTTCT^-^G - 

TCCCATACATCGGC CAAAC C CTCGTAGAA.TGGGCTTGAGGGGG CTT CT C AGT AGACAA 
TCCCATACXTCCXTC^^ 

TCCCATACATCGGCCAAACCCTCGTAGAATGGGCTTGAGGGGGCTTw a. 
~CCGATACATCGGCCAAACCA.TCGTAGAATGAGCTTGAGGGGG CTTCT C - G .^«w^- 
~ CCCATACATCGGC CAAAC C CTCGTAGAATGAGCTTGAGGGGGCTTCTCA.GTAGA^Aa ~ 
^ CCCATACATCGGC CAAAC C CTCGTAG AATGAGCTTG A.GG A.GG CTTCT CAGTAGAw^^- w 
TCC CATACATCGGCCAAACCCTC3TAGAATGAGCTTGAGGAGG CTT CT CAGT AGaw--tv^- ^ 
TCC CATACATCGG C CAAAC CCTCGTAGAATGAG GTTG AGGGGGCTTCT CAGTAoa^^^C 
TCCC CTACATCGGC CAAAC C CT CGT A.GAATGGGCCTG AGGGGGCTTCT-- - ^^^^^*Z Z 
TCCCGTACATCGGACAAACCTTGGTAGAATGAGCTTGAGGGGGGTTTT — -.G ^C^C^ 
TCCCATACATCGGCCAAACCTTAGTAGAATC<3GCCTGAGGGGGATTCrrCCG.w-^ i: 

TCC CATAC ATCGGT CAAACC CTAGTCGAGTGGGCCTG AGG ~C-^V,Z Z 

TCCCATACATCGGCCAAACCCTAGTCGAATGGGCCTGAGGAGGA* . - - TAG ..^^-.^w^ 
TCCCATACATCGGCCAAACCCTAGTCGAATGAGCTTGAGGGGGATTTTCAGTA 
TCCCATATATCGGCCAAACCCTAGTCGAATGGGCCTGAGGAGGATTCT^GTAGrtC^u^ 

TCCCATACATCCGGCAAACCCTTGTAGAATGGGCCTGAC-CAGGA . *CT - M 
^^CCATACATCGG ACAAACCCTAGTAGAATGAGCTTGGGG AGG ATTTTCAsj . .^.G/^uA^v- - 
TTCCATACATTGGCCAAACCCTAGTAGAATGAGCCTGAGGAGGATTCTCAG I AGAC^CC 
TCCCATTCATTGGTAACACATTAGTACAATGAATCTGAGGTGG A ; . 

TCCCATACATTGGCAATACACTAGTGCAATGAATCTGAGGGGGA * - C . 3— - — ..-^v-- 



XATACATCCGCAACACACTAGTACAATGAATC 
7 C CAT AC ATTGGC A C C 



TC AGG AGCGTTTT CAGTAG ACAATG 
k TGAATTTGAGGGGGCTTTTCCGTAGACAACG 



120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
12 0 



a-? 
c r-2 
add 
cry 
hip 
ale . 

i am . 
r 3 r. . 
.ir.m . 
?*-*■- ■ 



. me I 
. cr-i 
. r.as 
. dam 
. -cru 
.bus 

i ; — 

. hun 
. iua 
ta-i 
1 ~ r 
n.iy 



CAACCCTN'ACCCCATT 
CAACCCTTACCCCATTC 




:acccgatt 

CAACCCTTACCCCA7 
CAACCCTTACCCCA 
CAACCCTCACCCCA 
CCACCCTCACCCCA 
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CCACTCTCACCCCAT 



:ccac 



TTACACCCC- 
CATTCATCATTGCAGCCC- 
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CTACCCTAACCCG ATTCTT CCCTTTCCAC * ^CA 
CCACCCTCACCCCATTCTTCCCCTTCCACT u .ATCCTCC 
CTACCCTCACCCCATTCTTTCCCTTTCA^ 

CTACCCTCACCCGA^^-CC~CCACTTCATCCTCCCATTTATCA..GCA^ 



TCTTTCCCTT 
CTACTCTCACCCGATTCTT C GCCTT C CACTT 

7T T C CTTT CACTT CAT Z CTC C CAT 

C CACCGTGA.CC CGA TTCTT CGC CTTT CACTTT ATTTT^ ^ ^ 
CTAC CCTCACG CG ATTTTTC GGGTTTCACT ; .A. "3Z*^t~ 
C CACCGT C AC C CG ATT CTTT G C CTT CCA. . ^CATTCTCCCA. - 
CCACCCTCACCCGATTTTTTGCTTTTCACTTTATC 
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CAT CACCCCACCCCTCC 
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T CACTT ATCATCGCAGCCC" 



CTACCCTCACCCGATTTTTCGCCTTCCAC^CA, .C.C^--- - ^,_., r . ( ^ 

CAACTCTCACC^^ 

CAACCCTCACTCGATTTTTCGCCTTCCACTTTATCTTCCCATTTAi C^i s*^^-^^- 



CAACCCTTACCCGATTTTTCGCTTTCCACTTTATCCTC --- , — „ 

CAACC CTCAC CCG ATT CTT C G CTTTC 3^^^^*^q CC^^CG 

caaccctaacccgattcttcgctttccactttatcctcccat :ca,^. G — - " 

CAACCCTCACCCGATTCTTCGCATi. iCACTTCA' 
CAACCCTCACCCGA' 
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CAACCCTAACCCGATTr: 



:CACTTCATCCTCC 
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rTTT ATT ATT ACAG C GCTGO 



CAACCCTTACCCG ATT CTT CGC CTC CCACTTTATC CT C CCA CA^-^^^J^- ^ Z ^ 
CAACCCTCACTCGATTCTTCGCC^^ 

oaccctcagcc^^ 

^CAAC C CTCA C C CGATTTTTTG CTTTC CACTTT ATTCC C C CA - - ^^^^CT^Zl^ 
*CAAGACT CAC C CGACTCTTTGC CTT C CATTT CAT CTT 
CAACTCTAACCCGATTCTTTGCCTTCCACTTCATC 



CAACCCTTAC C GG ATTTTTCGCCTTCCACTTTA.T C CT 
CAACCCTCACCCGAT ~~ ' ~ 
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C AAC C CT CA C C C G ATT C 

caacactcacccga: 
caaccctcacccga' 

caaccctc 
caactctaacccga 
caaccctaacccga' 



CAACCCTAACCCGA 

caaccctaacccga 
caaccctaacccga' 
caaccctaacccga 
caaccctaacccga 
caaccctaacg 
caaccctaacccga 
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CAACCC 



TCTTTGCTTTT 
CTTCGCATT 



TT AT CG CAGCTCT C G 
TTGCAG C C CTAG 
CCTAG 

CATCCTCCCATTCATCATTGCAGCCCTCG 
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TTTTCGCTTTCCACTTTAT - - - - - - 

rCGCTTTCCACTTTATTCTCCCATTTATCATCG^.^^.^ 

TCATCGCAGCAC 
TTATCAT 

TCATC ATCA C AGC A 
TTATCAT CA 
TTATCATCA 
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:tc cactttatt ct c cc at 

TTCCACTTTA TTCTT CC 
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CAAC ATT AACACG CTT 
CAACACT AACACCC 



CAACACT AA C ACGCT 
CAACACT AACACGCTTC 
CAACACTAACACGCTTCT 
CAACACT AACACGT 
CAACACT AACACG C 
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rCATTATCCTACCATTAC 
7 CT AO C ACT AC 
7CTACCACTAG 
AC C ACT AC 

CTCCC CTTCATCATTATACCATT AC 
/TCATTACAGCATTAG 
'CATCATCCTAGCATTAC 

CCACACTAACTCCCT— ' - ~ ~ ~ ^ - ~ ~ — r :.;t ' :" i'CATTATT CTAC C G CT A J 

CAACACTAACACGC7 ^ ^ __ 

CAACACT^CACCCTTTTTCGCCTTCCACTTTATCCTCCCATTCA.^ ^ ^ _ 

CAACACTAACACGCTTCT TCCCTTTCCACTTTATCCTCCCATTCAT 
CAACACTAACACGCTTTTTCGCCTTC^ 

CAACACT AACACGCTTTTTCGCCTTTCAC rTTATCCTCCCA CAXCA. j. wAlAG^ . .Aa 
CAACACTAACACGCTTTTTCGCTTTCCACTTTATCCTCCCA. * CATCATCACAGCATTAG 
C^CACTAA^CGCTTTTTCGCTTTCCACTTCATCCTCCCATTCATCATCACAGCATTAG 




CAACACTAACACGCTT 
CAACACT AACACG CTTTTT C G CTTTC 
CAACACTAACACCC 
CAACACTAACACGTTv.u 
CAACACT AACACGTTTTTTCGCTTTC 
CAACACTAACACGTT 



CATTCATCATCACAGCACTAG 




rTTCGCTTTCCAC 



_ ACTAG 

^CC^CCCATTCATCATCACAACATTAG 
kTCATCACAGCATTAG 
rCACAGCATTGG 



CAACACTAACACGTTTTTTCGCTTTC CACTTCATC CT C C ' 
CAACACT AAC AC GCTTTTTCGCTTTCCACTTTAT C CT CC ( 

rz ACAC^ AAC ACGTTT CTTTGC CTTCCACTTTA TC CT C C CACTCA - ^OWiwti 

CAACATTAACACGCTTTTTCGCTTTCCACTTTATCCTCCCA r * w*- v^^C^—^-w 

C^CATTAACACGCTTTTTCGCTTTCCACTTTATCCTC 

- cC 1 - * 'CCTCCCGTT CATCATCACAGCATTAG 

"cCCGTTCGTCATCACAGCATTAG 

:cgtt 



CAACATTAACACGCTTTTTCGCTTTCCA 




CAACACTAACACGCTTTTTCGCTTTC 
CAACATTAA.CACCCTTTTT CG CTTTC 
C AACATT AACACG CTTTTT CGCTTT CCACTTT A 
CAACATTAACACGCT 



CAACACTAACACGATTC 
CCACATTAACACCCTTC 
CTACGCT AACACG C 
CAACACTCACACCCT 
CAACATTAACCC 
CAACACT AAC ACG CT 
CCACACTAACACCC 
CCACACT AACACG 0 
CT AC ACT AAC ACG C 
CC AC ATT AAC AC 
CC AC ATT AACACCC 
CCACATTAACCC 
CTACA "AAC A 
7AACACTAACC 
3CACCCTTACA 



^TCCCTT CATCATCACAGCATTAG 
_ C CATTCAT CATCACAG CATTAG 
TT CTTCC ATTCATCATCACAGCATTCG 

~ " ~ , "7 CATCACAG CATTA G 

--r—CCATTTATCATCACAGCATTAG 
CT CCCATT CATCATTAC AGCG CT AG 

_tccca: 

ct c c cttt c at c at cct aac a ct ag 

CTC C CTTTC ATCATCCT AAC ACT AG 
CATCATCCTAGCACTGG 
CTAGCACTAG 
TAT C A Z CCTAACACTAA 
TAGCACTAA 
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ccacccttacacga: 
ccaccctcacccgg: 



CCACACTTACACCA' 
CCACCCTTACCCGAT 
CAACCCTCACACCA 
CAACTCTCACACGAT 
CAACCCTTACACGAT 
CAACCCTCACACGAT 
C CACCCTTACACG AT 
CCACCCTTACCCGAT 
CCACCCTTAACCGAT 
CCACCCTCACACGAT 
CAACCCTAACACGAT 
CAAC CCTAAC ACG AT 
CAACCCTAACACGAT 
CAACCCTAACACGAT 
CAACCCTAACACGAT 
CAACCCTAACACGAT 
CAACCCTAACACGAT 
CAACCTTAACACGA.T 
CAACTCTAACACGGT 
CAACCCTAACACGAT 
CAACCCTAACACGAT 
CAACCCTAACACGAT 
CAACCCTAACACGAT 
CAACCCTAACACGAT 
CGACTCTAACACGAT 
CAACCCTAACACGAT 
CAACTCTAACTCGA.T 
CCACCCTAACACGA* 
CAACCCTAACACGAT 
CGACACTCACACGAT 
CTACCCTAACCCGAT 
CTACCTTAACCCGAT 
CTACCCTAACCCCA 
CTACCCTAACCCGAT 
CAACCCTAACACGAT 
CTAC CCTAAC ACG AT 
CTACCCTTACCCGA' 
CTACCCTAACACGAT 
CCACCCTTACTCGAT 
CT ACTCTT ACCCG A' 
CAACCC 
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\TTCTTCCATTCGTTATCACAGCAu . AG 1 



rTGCCTT CCACTTCATCCTTCCCTTTA «. i ATCCT 
rCCTCCCCTTTATTATCAT 
r ATT CT A C C CTTT AT CAT CA 

* ^TTTTACCTTTTATCATCGC 

rrCGTCATTACCGCC 



:tcgcct 
rrccccT 



TCCACTTTATCCTGCCATTCATCATTACCGC 




C™CGCC^TCACTTCATTCT^ 

™/ re rCGTAGTATCAGCACTAG 

TCCACTTTATTCTCCCCTTCGTAGCATCAGCACTAG 
rCTTTGCCTTCCACTTTATTCTC C C CTTCATAG CATCAG CACTAG 
PCCTCGCCCTCCACTTCGTTCTTCCATTCATGGCAi iAGCACTAA 
TTTCGCTTTCCACTTTATCCTACCATTTGTAGTATCAGCACTAG 
—CGCCTTCCACTTCATCTTACCATTCGTAGTATTAGCACTAG 
'CTTCGC CTT C CACTTCATC CTGCCATT CGTAGTATCAGC C CTAG 
TTTC GCCTTCCACTTCATC CTAC CATTCGTCGT ATCAGCACT AG 
"C^CGC CTTCCACTTTATCCTT C CCTT CGTAGTAT CAGCACTAG 



? C C A CTTT AT CTT AC CTTTTGTC ATTG C AG CTCTAG 

:taccttttgtcattgcagctctag 

rTTCATCATTGCAGCTCTAG 

:tag 



rTCGCCTTCCACTTTATC' 



CGCCTTTCACTTTATCTTA.CC 
TGCCTTCCACTTCATCCT 



,v; -^^ww, . i. w ~ m ^ 

•C~?CGCCTTCC\CTTrATCCCTCCCT~CGTAGTATGAGwA_i.^ 

."CTiCGCCCTCC^CTTTATCCTACCATiCSTAGCACTAGCACTAG 

"C^CGCCTTCCACTTTATCCTACCATTTGTAGTATTAGCA- - AG 

"'CTTCGCTTTCCXTTTTA'TTATACCCTTCATAGTATTAGCACTAG 

'C-TTGCCTTCCAC-rrTATCCTTCCGTrCATCATCTTGGCACTA^ 



CTTTGCTTTCCACTTTATC CTTCCGTTCATCATCCTAG aG 




"CATCATCGCAGCTCTAG 



•CTTTGCATTCCATTTCATCCTCCCT _ - 

^' — GCCATTTATTATTGCGGC-is- .AG 
~ - C CTTTT ATT ATTGCTGCC CTAG 
rCATTGCCGCTCTAG 



7ATTAT' 

rATCGTAGCAGCCCT" 




rCGCATTCCACTTTG* 
: CG C ATT C C A CTTT A' 




'OTAATTGCACGAAT ^ 
rAATTGCAGGAATTA 
rAATTGCAGGAATTA 
: AA TTGCAGG AATTA 

;taa"gta;gaatta 

:— TTT -A 7TT TA OGAA 7TA 
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CAACTCTCACCCCA 
CAACCCTCACCCCA 
CAACCCTCACCCCA 



ITAATTCCACGAATCA 



. v. CTTCGCCCCACA 



CAACTCTCACCCGATTCTTCCCCC 
CAACCCTCACCCGATTCT 
CAACCCTCACCCCA 




. ~CCCCTTCCTAATTGCACGAATTA 
rCTCCTACCCTTCGTAATCACAGCAATCA 
rCCTACTCCCCTTCCTAATCGCACCAATTA 
:CCTT CCTAATTCCAGGAA . CA 
-r-TTCGCAATCGCAGGAA-: - A 
CCACATTAACTCGATTCTTGACTTTACACTTCCTCCTTCCATTCATAA^A^^ ^ ^ ^ 
CCACATTAACTCGATT—rGACGTTACACTTGCTCGTTGCA^CAT^^ ^f^.-— ^ 

ccacattaactcgattcttcactttacacttggtcgttgca - . 

CCACATT AACT C G ATT CTT C ACTTT AC ACTT C CTC CTT C CATT CATAA . ^-^^ ^ 
CCACATTAACTCCATTCTT CACTT TACACTTGCTCGTrCCGTTCA^.^A^G^^^^ 
C CACATTAACTCG ATTCTT CACTTT ACACTTC CT CCTTC C ATTCAT AATC AT AGvj w 
C CATTAAC C C G ATT CTT C ACTTT AC ACTT C CT C CTC C CATT J^^^^^Z^ 
C CACATTAAC CCG ATT CTT CACTTT AC ACTT C CTC CT CCC ATTCATAATTATAuwCC^. CA 



CCA CATT AACCCGATTCTTCAC1 *~ - - - - - - - - _ _ „ , 

CCACA.TTAAGCCGATTCTTCACTTTACACTTCCTGCTGCCATTCATAATTA x AGcv- 

CCACATTAAC C CGATTCTT C A CTTTACACTTC CT CCT C C ' 

ccacattaacccga: 



:tctt 

ccacattaactcgattcttcacct 

C CACATTAACTCG ATTCT 
CGACATTAACTCC-ATTCT 



:ctca 



:tca 



:ttgctcctc'_ 

cj .-cattcataatcataggcctca 

CAACACTAAC CCGA.TTCTT C G G G C^ACACTTTTCT^C*rC C C ^ H^CZ^!^^^ 
CTACCCTAACCCGATTC: "" " 



:tcgccctgcacttccttctccccttcl 

CAAC C CTGACACGATTCTT C GC C CTACACTTT CTC CTC CCGTT 

caacactgacccgattcttcgccctacacttcctc.- _acca- - ^ZZ^7 

CAACACTAACCCG ATT CTTC GC CCTAC ACTTTCTC CTAC '^^^ " ^ ^ 
CAAC^CrGACGCGATTCTTCGCCCTACACTTCCTACj. Z CCA - - ^J^i^^^P^ 



CAACCCT AACTCG ACTCTTCGCCAC C CACTTCCTACTAC C J^JIlC— 
CCACCCTTACCCGATTCTTCTCCCTTCACTTCCTGCTCCCA. ; ^^-^^^ 
CAACACT CACG CGATTCTTCGCC CTACACTTC CTTCT AC C X^-^r ^"^^A 

CAACCTTAACCCGATTTTTTAC C CTT CACTT C CTT CT AC X^"^ 
CAAC C CTAAC CCG ATTCTTC A.CTTT C CACTTCTTACTGC CACTT ACC AT 
CAACCCTAACC CGATTCTT CA.CGTTC CACTTCCTAT . ACCATTTGCCA I 
CAACCCTCACCCGATT 
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CCATAGTCCACCTACTC 
CCATAGTACACCTACTCT 
C CATAGTCCAT CT ACT C 
CCATAGTCCACCTACTCT 
CCATAGTACACCTACTCT 
C CATAGTT C AC CT CTT AT 
CCATACTTCACCTCTTAT 
C C AT AGTCC ACCT CTT A 
C C ATAGTGC AC CT C 




CTA TACT CCA CCT A 
C CAT AGTCC ATTT A 
CCATAGT 
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CTATACTTCACCTACT: 
C C AT AGTG C AC CT ACT 
CTATAGTACATTT 




:tcctc 
~c 



^ _ _ _ ^ _ _ * » CCAT 

CCATAGTCCACTTACTTTTCCTCCACCAAACACGATCT^^ 
CCATAGTCCAOT 

CCATACTTCACCTACTATrCCTCCACGAAAC; 
CCATAGTCCACCTACTATTC: 

CAATAA?CCATCTACTCTTCCTCCATGAAACAGCGT : 
CAA~AG~C^C — ATTATTTCTC CACG AAACAGG AT' 

C^AGTCCACC-rATTATTrCTCCACGAAACAGGArCCXACAACCCAACAGGAAri. - -- 
^A ; GGTCC.^CCTArrATTCCTCCAtGAA.^CAGGATCCA.^CAACCCAACAGGA.^TCTCA. 
C^ATGGTACACGTACTATTCCTCCACGAAACAGGATCCAAGA^ 
CCAT^CCATCTTCTCra^ 

CTATACTTCACCTACT A. -■"*' • , - r '"" :,r 13 l " JVJA - " " 

CTATAGTACACCTACTAT' 




CCACAGTCCACCTACTATTCC^ 
CCA^AGTACACCTACJj^CTC^^^ 

CTATAGTTCACCTACn~TTCCTCC^ 

CTAXAGTCCACTTArrArTCCrTCATGAAACAC^ArCCAATAACCCCA o^*-*^- 

C CAT AGTGCACTT ACTTTCT CT C CAC G AAACAGG AT C C AAT AAC C C.-^. "^'^*^^ ^ ^ ~" ~ 
CTATAGTC^CCTACTTTTCCTCCACGAAACAG^ 

CCATAGTCCACTTACTTTTCCTCCACGAAJ^CAC-GATCCAACAACC^-" ^ "^--,^77^ 
CTATAGTACACTTACTCTTCCTTCACGAAACLAGGATCCAA-AACww^ * '^^^J,™, Z 

CTATAGTACACTTACiC.CC : .wACG^^C^w.- .^..-^^ rj , CG2J ,^ cn7r 
CTATAGTACACTTACTCTTCCTTCACGAGACAGGATCC^ACA^CCC^CAGv,^ . . 

CTATAGTACACTTACTCTTCCTTCACGAC-ACACGACC 

CTATAGTACACTTACTC 

CTATAGTACACTTACT' 



CTATAGTACACTTACTC 
CTATAGTACACTTACTC 
CTATAGTACATTTACTC 



CT AT AGT C C ATTTG CTT 



CTATCGTTCACCTACTC 
CTATCGTTCACC 
CTATGGTTC 
CTATCGTTCACCTACTC 
CCATCCTTCATCTAC 
TCCTAGTCCACCTTT 
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7, i. » ^.ALwu-AL^Uun * ^.^^^^^ 

?r CTTCACG AG ACAGGACCC AACAACCCAACAGGAA . . 
TCCTTC-^CGAGACAGGATCCAACAA.CCCAACAGG^TCC^ 



XACG AG ACAGG AT C ' 



rCACGAAACAGGAC 
rCA CG AG ACAGG AT CC AA 



• AAT AAC C CAACAGG AACT' 



CCCA' 

_ CTTCACG AAACACCCTCTAACAATCCA 
7T CT C C ACG AAACAGG AT C C AAC AA C C CAACAGG AAT CAC A^ 

~ 7 Vac aac c c aac ag g aatc ac at 

VCAACCCAACAGCAATCACAT 



CATC AAACAGG ATC C AAC AAC C CAACAGG AA : CACA . 
TACGA^CAGGATCT.-AT.^ACCCCACAGGAATTCC'- 
— CTCCACCACACACCATCAAATAACCCCACACG~~ - - — 
;AAACAGCATCCAATAACCCCACACCTr 



^ATCCAACAACCCCACACGTATT-CAT 
. . .ACGA/^CAe-ATCCAAT.^CCCCACAGGTATTCCA^ 
rCTACGAAAC-^reT^T^CCCCACAGj.C^.-^ 

t:a:wa:a -atccaac^ccccaca^.^-^^-.— 
■cca ocaaa :a :-at7caa :aa 2CCCacacc^-w 

'""AAA" 7 A ;-7A 7 OCA A 7AA 7CCCA Aw-.-A . - - -~ ~ 
7CA.7GAAATA 3.:ATCTAA-7AA7TCCA :a JOCAT-^^ 
'A 7AT0AAACA 7GA 7~" A 7AACC'7C~ 7A 7 GAA 7 . - ■ 
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llllTel CAGCcScSSa^ 

S5:2: ««*"^^ 20 

ia 9e . au S CAGCGGTCCAC ~ AC I A :i:;^ = * a 

lace obs CAGCCGTCCACCTAC7AT-rCC.A^AC^~^C.^^.v.-^AC.^^-^ 

iage .oc,, rac^XCTA-rCC-AC^CGAAACAGGATCCAACAACCCC^CAGv,^. iw-A- -4U 

1 , S SO . bo r ^^QXt^ixScC^^^G^C^ACACGAGACAGGArCCAATAACCCCACAGGAATTCGAT 2 4 0 

l.sscper t^gctCTcScCTGCTA^CCTACACGAAACAGGATCCAATAACCCCATAGGAATCCCAT 2 0 

glo .mac 1 AGl - 1 GTL _ ZlCt^X - - rirr A-eCAATAACCCCATAGGAA7CCCAT 240 

5lo.mel T^GCT ctc«cctw^a-.u ^ C ^ AATAACCCCACAG GAATCCCAT 240 

£--e a = C TAGCTG7TCACCTGCTA i XCr rt CACw^^CAGG«-w.^-^.uv. 

neoo'ele TAGCTGTCCACCTGCTATTCCTACACGAAACAGGATCCAATA^ -.0 

pepo.ele TAGC^GTTCACCTGCTATTCCrACACGAGACAGGATCCAATAACCCCACAGv,^^^^- --tO 

gram.gri TAGCTGTT MV^.- ^r- vir- <■"- A^rrAA^AACCCCACAGGAATCCCAT 240 

r a nrrirmc CTAC~ ATTC CTACACG AG ACT Gwi - <- i_Hu->. - .twaa- ^— 

P3 e . era ^^^Xl^^cAC ^GC^G^CCTACACGAGAOAGC-ATCCAATAACCCTACAGGAATCCCAT 240 

lage . acu Sg^Sc^AC^ J J 

orca.brs TA^C-GTTGAC*--. -.v. , rr A A~ AAC C C CACAGG AATC C CAT 240 

ill * c-o CAGCCGTTCACCTGCTATTCCTACACGAAACAGGATCC^^ 

5 ,5"? CiGC-GTrCACCTGCTATTCCTACACGAAACAGGATCCAArAACCCCAC-G^^C^ - * J 

d s ZZ: d l^ cagccg^Sggtgctattcctacacgaaacac^ 

cur . adu MC0BK ^ 2 4 ° 

saus.cii rAG ^ Cj ^^^ ^ 2,0 

seen. lor. O 00 ?™ 3 ^^ 240 

mono . men r^-CCGTCCACT .A- ..-w - - ,_ ri ,, CAAC — CACAGGAATTCCA . 2 4 0 

cla- can ^^ A - GCACCTACTA ^:.:I A ^ -° 

•, a - ^- CAGTTATCCACCTACTATTCCTA^C^^CAGw^^-W--^---^ 

? - a " - r . ,^- TCC ACC^CT^^^CTCGACGAAAC.AGGATCGAACAAC__C«-.^w^---^-- 

, 5 riATAA^^CACCT^C^ATTrCTCCACGAAACAGGATCCAACAACCC-v.^-^ — — 

Jccgi.Sim C^ATAAi-CA t - l -'- v -"^ — -~~^AACAAC'*CCACAGGAATTCCCT 240 

i ioo . vex caaccgtccac ^ acta II;^S-a^ 2 4 0 

phoe .sin TAATCGTCCATCTA " A 2.^ 240 

he-a ba' CAGCCGTCCACTTACrATTCCiCCACGAAACAGwA^-w^C^-- ^ 

5?;J'«: CAGCCGTCCACTTACTATTTCTCCACGAAACAGGATCT.AATAACC ^f^l £^1 ^ 

- ?h - Ca - r , ,_ rr -^ rAC ™ AC -^TCTCCA-GAAACACGA~CCAArAACCC.^C rt G^^^^-A^ j 

meso.eur . O^iCGTv-CAC i . , — •i—iiCCTACAGGAATTCCA . 240 

me so . bid CAAi.CGTv.CACv- <.«C , , — aa^AAC - -TACAGGAATCCCAT 24 0 

meso -den CAATGCTCCACCTACTATTCC^C.A.C^C^^. - ^"^^ OGAArTCCAT = 40 

hyp, . amp ^i^i^SE^It:^^^ - * 0 



c-sra 

bab/. bab 



:u3 . bar 

3 li J - 3 



CAATTGTCCATTTACTATTTCTACACGAAACAGGATCTAATAAT--- ^GGAATCTCA": 
TTATACTCCACCTCCTATTCCTACACG^ 



240 



poac .bu TTATACTCCACCT ^;:^ : ' ;0 

«c. lib CCGCCCTCCACCTACTOT 



hipp . amp 

die . surn ^aa 



CCGCCGTCCACCTACT^ - — ^ - -~ - - ArAGGAA: 

C-A-r C ^--AT^TACTATTCCTCCATGAAACAGCATCCAACAAC^-^CAGU^ 

CAATTACCCACCTGCTA7TC2 , ^^C^^.^ . ^^IZZ- ^"-^^GAATTCCA' 

CGATCACCCACTTACTATTCCTACACGAAACA^GA^--^^^^^-^^ 



CAATGACCCACCTACTATTGGTTGACGAAACAGGATG^ 
TAATCGTGCATCTACTATTGG.C^^^A^w^^.^-^^; 

CAACCCTACAT-TATTATTGGTTTf^^s*A#*A*-. , 7 , 1, _Z-- A , ^GAATGTCAT 

CAACCCTACATCTGTTGTTGGTACAGGAAAC : GCA : 'ZTAAGAA^^.-^ 1^1^^ : A "T^AT 



=,:= ; cagccgtacatcttcta tt-tata-aaa ----atccaataa... 



f-wb ) 



CAGGGGTACATGTG-2T A :TGG . ^ k - - ArA ^' " 



_ •« 'y 

2-4 0 
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lama .gla 

lama . gua 

vie. vie 

cam. bac 

arc . for 

arc .gaz 

eum. j ub 

sal . cal 

odo.ros 

pho . f asciaca 

pho.gro 

pho .vit 

cys.cri 

hyd.lsp 

lep .wed 

mir . leo 

eri .bar 

mon . sch 

hela.mal 

sel.fcSii 

ail.fful 

f el . 

car* 

cal 

gla . sab 
gla- vol 
byl.pha 
?e:.se: 
bel .pea 
pt-.mcm 
gala.demi 
pero . pec 
gala .ma" 
gala . rr.oh 
oro.gar 
Icr - car 
nyc - ecu 
mus 
gorr 
he mo 
dug .dug 
ele . max 
af r -cor. 
pave- . muC 
rra.bl/ 
era . sa - 
cra . cob 
:ra . c 
arg -arg 
ca: - I 
c ro . era 
sym - r**; 
bam . c.^.o 
fra . fr.i 
ith.cri 

3 r*i i.t: i"". 



tcct 
rcrccr 



CAGGAGTACATCTACTATTTTTACA 

CAGCAGTGCATCTACT^^ 

CGGGAGTACATCTAC^^^ 

TAGCCGTACACCTATTATTCCTACACGAAACAGG 

TAATAGTACATCTGCTATTCCTACATC^ ^ ^ 

T AATAGTGCACCTACTATTCCTAC " ^ * A ' " ~ * 

TAATAGTACACCTATtATTCCTAC^v-w* - " , ,~, 

TAATAGTACACCTATTATTCCTAOKOGA^ 

CAGCAGTACACCTACrATTTCTCCACCAAACAGGATCTAACAACCC I 1 C^^.Cu^ 
CGGCAGTTCATCTACTATTCTTACACGAAACAGGATC CAACAAC C C C ACCGG AA i» CG t A * 

CAACAGTCCAOC^ 
CAGCAGTACATCTACTATTCrrTACAC 

CAGGAGTACATCTACT ATTCTTACACG AG ACAGG ATCCAACAACCC E^^'-rp^ r^T 
CAGCAGTACATCT^CTATTCCTACACGAAACAGGATCCAACAACCCCTCTGGriA CwC^wT 

CAGCAGTCCATTTATTATTTCTACXCGAAACAGGATCCAACAATCCCTCCGGA^^C^*^* 

SgIa^CATC^^ 
CAACTATCCATCTCTrATTCCTA^ 

CAATAGTACACCTCCTATTTCTACACGAAACCGG ATCCAACAACCCTTCA - * 
CTGGAGTTCACCTGTTATTTCTTCACGAAACAGGATCAAACAACCCA 

CCATAATCCATCTACTCTTTT^^ 

CCATAATCCATCTACTCTTrCTACACGAAACAGGATCCAAJAACCCAT 
CTATAATTCACCTTCT<-~i''IVC7ACACGAAACAGGATCAAATAACC 

craraAicaccrrcrc^^ 

CAATAGTCCACCTTCTTTTCCTCCACGAAATT^ 

CAATAGTTCACCTACTTTTCCTTCATGAAACAGGGTCCAACAACCCATC i vjw 

"CTATTCCTTCACGAAACAGGATCAAACAACCCCrC^GvjACT. 



CT ATC AT AC ACCTC ATCCTCTT AC ATC AA v CAGGC 



7CA7 

CCACTSCCCATCTCAT 

CCArCATCCACCTCATCTTCCTACATGAATCAGGCTCTAACAA^^^^-^---^--^^ 
CCATCATCCACCTCATCTTCCrrACATGAATCP^r-- i -ACAAGr - . _ - — - 



CCATCATCCACCTCACATTCCTACACSAATCACGC^ 
CCATCACCCATCCCATATTCCTACATG^TCAGC^ 

■""TACACGAATGAGGATCAAACAACGG^CTAGGCATCTCA 




CCATCACACATCTTAT^TT" 

CCATTATCCACCTCACATT'- . - - . - ■ - - - - - _ — 

CTATCATCCACCTCACATrrCTGCACGAATCAGGCTCAAACAA — ^jr;^ 

ctgtcatccaccttacactcctccacgaatcaggtt-aaataa 3 - -AC.-- - " 

C-CTAATCCACCTCACCTT-CTTCACGAGT'— ™— A^CAAC -G^O* . 

:ctt' 



CCCTAATCCACCTTA 7CT" 



"ZAAATAA3* 
rCGGCTC AAACAA c 
CACGAATCC^GCTCAAACAA rTTCCTAGGCA"! 
•CAC^AA7CCGr^C7CAAACAA"C~CTAOGCA: 



TCATAATCCACCTCCTATTCCTTCACGAAACAGGA7 C.AAACAAU ~ jrTJ^TT t-^^cA^ 
CCACAACTCACCTC^ 

ccATAATraccTTrrr^^ 
ccAXAATTCATcrrcTTrrrrr 

TCATAATCCACCTCCITrrCCTCCAro 
CTGCAATTCACCTACTTTTC CTACACGAAT CAGG ATCT 
TTGTGATTCACCTCATCTTrCTACATGAAACAGGCTC; 

CAATCGTTCACCTCCTCTTCCTCCACG AAACAGGATCAAACAACC^^ _ 

CAACCCTCGATCTCCTATTTCTACACGAAAGAGGATCAAACAAC . TC - "^^^^^ 
CAACACTCCACCTCCTATTCTTGCACGAAACGGGATCAAACAACCCCCTAG^AA^^^u^^ 

T AATAGTCCACTT ACT ATTCCTCCACGAAACAGC CTC CAACAAC C C ^^^^^^^.^V^^l 
CACGACTGC^CCTAACCTTrcrrTCACGAAACAGGCTCAA^CAAC -^^^^,1^ T Z 
CAATTATCCACCTCACATTCCTTCATGAATCAGGCTCAAACAAC 
CAATTATCCACCTCACATTCCTCCATGAATCAGGCTCAAATAA - - -^^^ 
CCATCATGCACCTCATCTTCTTACATGAATCAGGCTCTAATAAC . ,mC - 
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gru . ant .gil 
gru . ant . sha 
gru . leu 
gru . can.pra 
gru . can . row 
gru . can . tab 
gru . can .can 
gru . acne 
gru . gru 
gru . mon 
gru . nig 
gru. jap 
cic .boy 
rhe . ame 
ant . alb 
fal.fam 
f al . ver 
f al .per 
f al . s?a 
ayt . ame 
smi . sha 
vid.cha 
chry .pic 
emy.orb.kur 
che . mud 
eum . egr 



aep . 
ore . 
add, 
ory . 
hi?, 
ale. 
sig. 
be a . 
dam . 
con . 
aran . 
pse . 
cap . 
hem . 
cap . 
rup . 
run . 
nem . 
bud . 
pan. 
ov i . 
ov i . 
cap . 
ov; . 
ere . 
c^p . 
cep . 
bis . 
boi . 
boi . 



mel 
ore 
Has 
dam 
equ 
bus 
lie 
hun 
lun 
cau 
ier 
nay 
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j em 
fal 
?yr 
ru? 
cau 

ca;< . tax 

hod 

a mm 
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CCCTAATCCACCTCACCTTCCrrCACCAATCCGGCTCVVACAACCCCCTACMCATC^A. 
CCCTAATCCACCTCACCTTCCrrCACGAATCCGaTTCAAACAACCCCCTAGGCATCG.A, 
CCCTAATCCACCTCACCTTCCTTCACGAATCCGGCTCAAACAACCCCCTAGGCATCG.^. 
CCCTAATCCACCTCACCTTCCrrCACGAArCCCCCTCAAACAACCCCCTAGGCA^o^ 
CCCTAATCCACCrCACCTTCCTTCACGAATCCGGCTCAAACAATCCCCTAGGCA^^^ 
CCCTAATCCACCTCACCrrTCCrrCACGAATCCGGv. i CAAACAACCCCC7AGGCA^« ^. 
CCTAATCCACCTCACCTTCCTTCACGAATCCGGCTCAAACAACCCCCTAGo^^ i . w . 
CCCTAATCCACCTCACCTTCC7CCACGAA7CCGGCTCAAACAACCCCCTAGGCATCG7AT 
CCCTAATCCACCrCACCTTCCTTCACGAArCCGGCTCAAACAACCCCCTAGGCATCGTAT 
CCCrAATCCACCTCACCTTCCTCCACGAATCCGGCTCAAACAACCCCCTAGCCATCGTAT 
CCCTAATCCACCTCACCTTCCTCCACGAATCCGGCTCAAACAACCCCCTAGGCATCGTAT 
CCCTAATCCATCTCACTTTCCTCCACGAATCCGGCTCAAACAACCCCCTAGGCATCGTAT 
CCCTAATCCACCTCACCTTCCiTCACGAGTCCGGCTCAAACAACCCCCTAGGCATCATCT 
CTCTTATCCACCTCACCTrCCTACACGAAACCdGGTCCAACAACCCCTTAGGAATCGTAT 
TCCTAATTCACCTGGCATTCCTCCACGAArCAGGCTCAAACAACCCACTAGGC\rCACAT 
CC^AA^CCACCTCACCTTCCTACATGAATCAGGTTC^AACaACCCCCTAGGAATCACAT 
CCCTAATTCACCTCACCTTCCTACACGAATCAGGTTCAAACAACCCCCTAGGAATCACAT 
CCCTAA^CCACCTCACCTTCCTAGATGAATCAGGCTCAAATAACCCCCTAGGAATCACAi 
CCTTAATCCACCTCACCTTCCTACATGAATCAGGTTCCAACAACCCCCTAG^ 

CCCTAGTCCACCTAAC i - ^...w^C^C^ewLiw^WAt. — "12?,,—, tv—v-* - 
CACrCATCCATCTCACCTTCCTCCATGAAACAGGTTCAAACAACCC I CTAGG ^T^.C.^. 
CTCTAGTCCACCTCACATTCCTACACGAAACAGGATCAAACAATCCAATAGGAATTC^- 
C^TAGTACACCTACTTTrTCTACAXGAAACTC-GATCAAACAACCCAACAGGA^A^-. 
CAATAGTACACCTACTCTTCCTACACGAAACCGGArC^CAATCCAACAGGA^^^ 
CAGCAGTACA7CTATTATTCCTGCACGAAACAGGATCAAACAA.CCCAACAGGAi iAA.^^ 
CAATAATTOACCTACTATTTCTTCACGAAACAGGATCA.AATAACCCAACCGGACT.-^- : 

CAGATTCAG™ 
CAGACACAGACAAAA^ 

CAGACACAGACAAAATCCCATTCCACCCTTACTATACCATTAAAGACA - ^J^j^^^^^Z 
CAGACACAGACAAA^^ 

rrccAC c c ctact acac cattaaag acatt c : ~g^c *c*.~ 



do c 
ma;c 
boa 

t :a 



CAGACTCCGATAAAACCC— , 

C AG ATGCAG AT AAAATT C C ATT C CAC C C CT ACTACAC ^3 5^X5^ 
CACATCCGCACAAAATCCCGTT TCACCCCTACTACACTATCAAAGACGCC^AGG^^ 
CCCACACCCATAAAATCCCATTCCCCCCCTATTACACCAC^^^ 

C AG ACGCAG ACAAAATC C CATTC CAC CCTTACTACACCATCAAAG ATA^^C^.^G^^G^A 
CAGACACAGACAAAATCCCATTCCACCC^TACTACACCATTAAAGA.A^^G^C^^^ 

CAGACACAGACAAAATC CCATTC CAC C C CTACT AC AC CATTAAAG ^^^^ZZ^ ~r-C 
CAGATACAGACAAAATCCCATTTCACCCTTACTACACCATTAAAGATAT * i ACGC^A 
CAGACACAGACAAAATCCCATTTCACCCTTACTACACCA— AAAGATATCC^G^^C^ 
CAGATGCGCATPJXAATCCCATTTCACCCCTACTATACCATTAAAGACATTC ; ^G^^ : A 

CAGATGCGGACAAAATCCCATTT-rACCCCTATTATACCATCAAAGACATTC * — 

C AG AC AT AG ACAAAATC C C A' 
CAGATGCAGATAAAATTCCAT 
CACATGCACACAAAATCC 
CCCACACAGATAAAATTC CC 
CCGACACAGACAAAATCCCC 
CAGACACAGACAAAATCCCA 
CAG AC ACGG ACAAAA TC C C A 
CAG ACGCAG ACAAAATTCCAT 
CCG ACGCAG AC AAAATT CCA 
C AG ACGCAG AC AAAATT CCC 
CAGACACAGACAAAAT' 
CAGACGCAGACAAAATTTCA'! 
CACACGCAGATAAAATCCCA": 



_ . .. ATT AT AC AATC AAAG AT ATT CTAGG CGCT A 
kCCCTTATTACAC CATTAAAG AT ATCC . AGGAG rCA 
rCCCTACTATACCATTAAAGACATCCTAGGCGC^A 
_ ^CCCTTACTACACCATTAAACACA .CCTAGGTCC^A 

crr?i?r>oni7 , r?r>ncrj fti'kiTf att aaag acattc - cggtgcca 

CCACCCCTACTACACAATCAAAGATATCCTAGGCATC^ 
CCCTACTATACAATCAAAGACATTCTAGGCGC— 
CACCCCTACTATACTATTAAAGATATCC " ACCCCC--A 
CACCCCTACTACACCATTAAACACATCCTAGG.-^- 
CACCCCTACTACACTATCAAACACATCCTAGGCGCC- 
CTTACTATACCATTAAAGACATCCTAGCACC^ 
'ACTATACTATTAAAGACATCTTAGGAGCCT 
'ACTACACTATTAAAGA'CATTC . AGGA :CCC 
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bub . min 
buba . bub 
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kob .ell 
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red. ful 
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pel .cap 



gac 
our 



.'dam 
. our 



ant: . cer 
sai . cac 
rcad . kir 
rap . mel 
ca: .can 
ane . ame 
hyd. ine 



ale . ale 
cer. ela.kan 
cer . ela . ;<ar. 
cer .-la. can 
cer. nip. cent 
cer. nip. yes 
cer .nip.ker 
cer.nip.pul 
cer. nip. nip 
cer. ela. sec 
cer .dam 
ran. car 
mcs . fus 
mos . leu 
mcs . chr 
tr.es . ber 
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bala . ber 
bala . edi 
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Li-r;o p^ r 



CA»CA«CACAAAATCCCATTCCACCCCTACTACACCArrAAACACA*^^ 
CACACAO\GACAAAA7CCCAT7CCA^^ 

CACACATACACAAAAT7CCATTCCACCCCTA i t ACACTATCAAGGACATC^Abw^u 
CHAACATAGACAAAATTCCATTTCACCCTTACT^^ 

CAGACATAGATAAAATCCCATTCCACCCCTA^LACACCArC.AAAGACA^C^^^^w^ 

CAGACACAGACAAAATCCCATT CCACCCAC ATTATACCATCAAACA - - 
CAGATGTAGACAA-^7CCCArTTCATCCATACTArACTArCAAGGACG^C,A^^^^ :: 

CAGAYATTCACAAAATCCCArTCCACCCNTACTACACCArCAAAGA'/AT: CTAUw 
CAGACGCAGACAAAATCCCATTCCACCCCTACTACACCATTAAAGACATT^- * mwU.., 
CCGACATAGACAAAATTCCATTCC^CCCATACTACACCATTAAAGATA^^CTA^u vjv-~ . 
CAGATGCAGACAAAA-~CG-T CCACCCCTACTACACCArCAAAGACArTCTAGGAGCAC 
CAG ATGCAG ACAA3GT C C CATT C C AC C C CT ACT ACACC ATTAAAG ACATC CT AGGCG CCT 
GAGACGCAG ACAAAATTCCATTCCAC CCCTACTACAC I AT CAAAG AT AT C C ^AGu^uC - - 
C AG ATTCAGACAAAATCC CATTCC AC C CCTACT ACAC CATTAAAGACATTCTAGG C GCCC 
CAG ACG CAG AC GG AAT C C CATTC C GC CC CT ACT ACACT ATT AAAG ACATC ZT^JZGCG CCC 

- " CT A CTACAC CATTAAAG AC A7TTTAGG AG CCC 



CAGACGTAG ACAAAATT C CTTT C CAT C C CT ACTATAC ^ATTAAAGATATTTTAGG^Gv. w 
CAGACGCAGACAAAATCC CATTT CAC C CTTACTACACTAT ^*' 2l ^^^^^^^^^l^^ ^TZ^Z 
CAGACGa^GAC^ATCCCCTTCCATCCTTACTATACCATT. AAA.G^^-G^^ 
CAGACGCAGACAAAAT CCC CTT C CATCCTTACTATACCATTAAAG ATAT^^rt^*^ * CT 
CAGACGCAGACAAAATCC C CTTCCACCCTTACTATACG ATTAAAGATA^C^aGw ^ a- ^ 
CGGACGCAGACAAAATC C CCTTC CAT C CTTACTACAC ^^"^^^ ^^^^^^^^TiX^ 
CGGACGCAGACAAAATC C C CTT C CATC CTT ACT ACAC CATTAAAG.-* - a* ^- ~ ^?.1Z~ 
CGGACGCAGACAAAATCCCCTTCC^^^ 

C GGACGCAG ACAAAA.TCC C CTTC CAT C CTTACTATAC ^^^^^^^^3^*1^ X^g^^^C^ 
CGGA.CGCAGACAAAAT C C CCTT C CATCCTT ACTATAC CATTAAA.G ATAT\- Cj. ^Z^Z 
CAGACGCAGACAAAATCCCCTTTCATCCTTATTATA.CCA * •^^"^"3"^3^^C^r\^r ^ 
C AGATGTAG AT AAAATTCC CTTTCA7CCCTACTACAC -^-"^^^^^^333^^ ^^^^ 
CAGACTCAG ATAAAATT C CATT C CATCCCTATT ATACT AT CAAAGACA - i ^^^X^- X Z 
CAGATATAG ACAAAATCCCATT CCACC C CTACTACACC ATCAAAGACA^CT^G*^^^- 
C AG ATATAG ACAAAAT C C CATTC CACC CCTACTACAC CAT CAAAG A - 
CAGACATAGACAAAATCCCATTCCACC^^ 

CAG ACAT AG ACAAAAT C C CATT C CAC C C C TACT AC ACT AT ^*^^^^5^Z3^3 v~ <~ C 

CACACGCAGACAAGATCCCCTTCCACCCATACTACACCATC^AAGA ^^^Z 
CT^ACATAGACAAAA^CCC^TTCCACCCCTACTACACAATCAAAGACATTC CAG^^v- - - 
CTGATAT AG ACAAAATCCCATTCCACCCCTATTACACAAT CAAAG ACATTCTAGGCCCCC 

ccgacatagacaaaatcccattccacccttact; 

CCAACATAG ACAAAAT CCCATTC CAC CCCT A . „ ACACAAC1 
CCAACATAGACAATATCCC^TTCCACCCCTATTACACAATTAAAGACA. 

CTG ACATAG ATAAAATTCCATTCCACCCCT ACTACACAAT ^ 
CCAACATAGACAAAATCCCAT I C CAC CCT I ACCACACAA ^ 
CCCACA7AGATAAAATCCCATTCCACCCCTACCACACAA Z . AAA3ACA 

CCAACATAGACAAAATTCC 
CCAACATAGACATAA 



CCAACATAGACATAATCCCATTCCACCC 
CCAACATAGACATAATCCCATTCCACCC 
CCAACATAGACATAATCCCATTCCACCC 
CCAACATACACATAATCCCACCCACCC 
CCAACATACACATAATCCCA 7TCCACCC 
CCAACATAGACATAATCCCATTCCACCC 

CCAACATAGACATAA 77CCA 7 ^'^ llZ 
CCAACATAGACATAA 7TCTA 77 C -A ICC 



TATTACACAATTAAACACACCCTACCCCC^ 
TATTACACAATTAAAGACATCCTAGGCGC^ 
TATTACACAATTAA^ACATCCTACOCCC^ 
CATT A CACAATTAAAGACATCTTAGGCGC^ 
TACTACACAACTAiAGACATCCTACGCCC^ 
TACT A -A ZAATTAAA 3ACATCCTACvCC3CT 
TATTA 7A CAATTAAA3ACATCCTAGGTCCT 
TA 7TA TATA ATT AAA 3 A - A T C CT C C C C T C 
T ATT AT A TA A 7TAAA : A 7A 7CCTCGGCCCT 
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CC^CATACA^TAATTCCATT^ 

CCAACATAGACATAATTCCATTCCACCCCTATTATACAACTAAAGATATCCTAoG : o :;: . 
CCAACATACACATAATTCCATTCCACCCCTATTATACAATTAAACACA^v^AoGwvj^^^ 

CCAACATAGACATAATTCCATTCCACCCCTAT^ACACAA^^^ 

CCAACATAGACAT AATTCCA . . CCACC~ - - A . . ACACAA - ; X^G*^^^^^-^ 
CTAACATAGATATAATCCCGTTCCACCCTTATTATACAATTAAAGATATC^^rt^^^^^^ 

CCAACATAGATATAATCCCATTCCACCCTCTATCACACAATT.^^ 

CCA^C^TAGACATA^TCCCATTC^CCCTTArCArACATTTAAAGACATCO^GwC^^ 
CCAATATAGACATAATCCCVTTCCACCCTTA - TA T AC.AATCAAAGArAi CC^.^^^^-- - 

C " AAC AT AG " C AA — " r ^ CACCTTATTATACAATCAAAGATATCCTAGG j. Gv-CC 

CCAATATAGACATAATCCCATT^CCACCC^ 

CCAAT ATAG A CATAAT C CC ATT C CAC CCTT ATT AT AC AAT ^ o~ 

ccaatatagacat.aattccattccacccttattatacaattaaagata,cc^. : g^ : ^^ 

CCAATATAGACATAATCCCATT^^ 
CCAATATAGACATAATCCCATTCCACC^^^ 

CCAACATAGACAT^-CCCATTTCACCCTTATTATACA-ATCAAAGACATCCTAC^.^^, 
CCAACATAGACA^AATCCCATTCCACCCTTATTATACAATCAAAGACATCCTAGG iG\jC t, 

Sa^aSca^a^ 
ccScSagStSt? 

CCaACATGGATACAA^CCCATTCCACCCCTACTACAC^ATCAAAGAaAiCCT.-xGv, 4 CC^j. 
CCGACACTGACAAAATCCCTTTCCACCCCTACTACACAA^ 
' CCGACATAGACAAAATCCCATTCCA^^ 

CTAACATAGACAAAATCCCATTCCACCCCTACCACACAATTAAAGATA- ^ . ;^«^^-= 

SSStagacataatc^ 

CCAATATAGATAAAATTCCATTCCACCCCTACTAT^^^ 

ccgatatagacaaaatcccattccacccttattacac* ? - - r -- ri - 

CTGATATAGACAAAATCCCATTCCATCCTTAC 

CCGACATAGATAAAATTCCATTCCACCCCTAC ... - - - - _ „ 

CTGACATAGATAAAATTCCATTTCACCC^^ 
CTGACATAGACTAAAATCCCGTTCCACCCATACTACACAAT^AAAG 

CTCACATAGACAAAATTCCATTTCATCCTTA^ 

CTAACATAGATGCCATCCCATTTCACCCC - ACT ACACAA- - AAAw^A . - 
CAAACGCAGACAAAATCCCATTCCACCCCTATTAC»CAATCAAAwa-A^^ 

CAAACGCACACAAAATCCCATTCCACCCCTATTACACAA^ 

CTAACATAGACAAAAT'"'"' " " 1 ~ "~ ' - 



CTAACACAGACAAAATT 
CCAACATAGACAAAATCCCA 
CTCACATAGACAAAATCCCA 
CAGATATACACAAAATCCCA 
CAGACATACACAAAATT 
CACACATACACAAAATTCCA 
CAGACATAGACAAAATTCCA 
CCGATATACACAAAATCCCC 




CGCATATACAGAAAAT' 
CAGATATAGACAAAATTCT-r 
CAGACATAGACAAAATTCCA 
CTCACTCACACAAAATCCCA 
CTCACTCGCACAAAATTCCA 
CCAACTCACACAAAATTCCA 

CTCACTCACACAAAATCCCA 



CACCCTTACTACACAATCAAAGACATCCTAGGAGCCC 
TA CTACACAATCAAACACATCCTGGC AATTT 
CACTACACAACTAAACACATCCTAGCACTTC 
TCACCCCTACTATACCATTAAACACATTCTAGCAGCCC 
TATTACACCATTAAACATATCCTAGCACCCC 
TCACCCATArTACACTATCAAAGACAr-CTAGGAGCC^ 
TCACTCATA CTACACTATTAAAGACATTCTAG^G.- . 
CCATCCCTA ^g^^g*"AC 

CCA "CCTA :taca caattaaacacatcctagcac..^ 

TCACCCATATTATACAATTAAAGATATTC.^s,^.--- 
CCACCGATATTATACAA7TAAAGATATCCT-.'.r^---- 
CCATCCATATTATACAATTAAAGATATCCTCCGAA.--- 
CA TCCA TA ~.\ TA T AATTAAAGATATCCTAOvAA - - - 

-a tcccta :ta :a taa ttaaa :atatgcta«;~cctca 



WO 02/077278 



46 



PCT/IN01/00055 



pho. fasciaca 
pho.gro 
pho . vie 
cys . cri 
hyd. l«p 
lea .wed 
mir . leo 
eri .bar 
men. sch 
hela .mal 
sel . chi 
ail.ful 
fel 
can 
tal 

gla . sab 
gla. vol 
hyl .pha 
per . sec 
bel.pea 
pes. mem 
gala . demi 
pera . poc 
gala .mac 
gala.moh 
oco .gar 
Icr . car 
nyc . cou 
mus 
gcrr 
hemo 
dug . dug 
els. max 
afr. con 
pavo . muc 
era .bly 
era . sac 
era . cob 
era. csm 
arg.arg 
cae . wal 
cro . cro 
svm . rse 
bam . eho 
fra . f ra 
i e h . c ru 
ar.c .?ac 
a n e . v i r 
gru . ar.e . ar.e 
gru . an: . gi I 
gru . ar.e . 3 ha 
gru . I^'u 
gru . car. . p r a 
g ru . can . row 
g ru .can . eab 
g ru . can . can 
gru am*; 
gru . gru 



CCGACTCAGACAA^TCCCA— CCACCCATACTATACAATTAAAGATATCCTAGoAww^ 
CCGACTCAGACAAAATCCCGCTCCACCCATATTATACAATTAAAGATATCCTAGGAG^u 
CCAACTCAGACAAAATCCCA— CCACCCGTACTATACAATTAAAGATATCCTAGow^C^ 
CCGACTCACACAAAATCCCATT CCACCCATACTATACAATTAAAGACATCCTAGGAG^v.v. 
CCAACTCACACAAAATCCCATTTCACCCCT^ 

CTGACTCAGACAAAATCCCATTrCACCCCTACTACACAArCAAAGACA^^^GoAu^^v. 

CCG ACTCAG ACAAAAT CCCXZZCZACC CAT ACTA CACAA T CAAAG AT AT CTTAGG Au C CC 
CCGACTCAGATAAAATTCCATTCCACCCATACTATACAGTCAAGGACATCrTAGGGGC^ 
CCAACTCAGACAAAATCCCATT CCACC CAT ACT ATACAATTAAAG AC ATTCTAGGAGC . l 
CTGACTCAGACAAAATCCCATTTCACCCGTACTATACAATTAAGGACATCCTAGGCGCCC 
CCAACTCGGACAAAATCCCATTTCACCCATACTATACAATTAAAGACGCCCTAGGCGCCC 
CCAACTCAGACAAAATTCCATTCCATCCCTATTATACAATTAAAGATATCTTGCGCGCTC 
CCGA^CAGACAAAATCCCATTCCACCCATACTATACAATCAAAGACATCCTAGGTC . . C 
CAGACTCAGACAAAATTCCATTTCACCCTTACTACACAATCAAGGATATCCTAGGAGCCT 
CAG ATACGG ATAAAATTCCATTT CACCCCT ATTACACTATTAAAGACAT C CTAGGAGCAC 
CTG ACTCAGAT AAAAT C C CACTCCAC C CTTATTTCTCAATTAAAG ACACCCTAGGA^*-^* 
CTGACTCAGACAAAAT CCCATTCCAC C C CT ACTTCTCAATTAAAGATACC^AGG A . ^ ^ 
CCGATTCAGACAAAATCCCATTTCACCCATACTATTCAATTAAAGATC I CCTAGwC^Cw 
CCGATTCAGACAAATTCCCATTTCACCCATACTATTCAATTAAAGATCTCC 

CTGAATCTGATAAAGTACCATT C CAC C CAT ACTT CACAAT ^^^^^^^IZ. ^ X T X 
CCGAATCCGACAAAATCCCATTCCACCCC^^ 

CAGACTCAGACAAAAT CCCCTTTCACCCCTATTACATAATCAAGGATCTCCTAGvxAC jl ^A 
CAGAATCAGACAAAATCC CCTTCCACCCCTACTACAC CAC ^ 
CAGACTCCGACAAAATCCCATTCCACCCCTAra^ 
CAGACTCCGACAAAATCCCCTTCCACCCCTACT^^ 

CAG ACTCTGACAAAATCCC CTTC CAC CCCT ATTACACAATTAAAGACCTTCTAGwG w ^ 
CAGACTCTGACAAAATCCCATTTCACCCCTACTACACATTAAAAGATATTC-^^^^G^^^ 
CAGACTCAGATAAGATTCCATTTCACCCCTACTACTCACTTAAAGACCTCCTAGwAG.G^ 
CAGATGCAGATAAAATT CCATTT CAC CC CTACTATACAAT CAAAG AT ATCCTAGGTA^ w >~ 

CCCACTCraXCAAi^TCACCrTC 
CCCATTCCGATAAAATCACCTTC^ 

cagact^ 

CCAATTCAGATAAAATCCCATTCCACCCGTACTACTCCCT CAAAGATATCC,A^^-- 
CCAACT»GAC«AATTCCGTTCCACCCATACTACTCCCTCAAAGATATC-.AG^^^ 

CTAACTCTGACAAAATCCCATTCCACCCGTACTACTCCCTC^ 

CCAACTCTGACAAAATCCCAT^ 

CTGACTCTGACAAAATCCCATTCCACCCGTACTACTCCCTC^GATA,w^GwwC ^ 

CTAACTCTGACAAAATCCCATTCCACCCGTACTACTCCCT CAAAG AT/^^C^^^^^^^ 
CT^CTCTGACAAAATCCCATTCCACCCATACTACTCCCTCAAAGACA^^-G^C^^ 
CT AACTCCGACAAAATCCCATTCCACCC AT ACT ACTCCCT CAAAG ATATCC ;ft ^^^ 
CTAATTCCGACAAAATCCCATTCCACCCCTACTACTCCCTCAAAGACATC^AG^^-^^G 
CTAACTCTGACAAAATCCCCTTTCACCCATACTACTCTCTCAAAGATA^-.-^^^ 
CT AACTC CG AC AAAATC C C ATTC C ACC C ATACT ACTCCTTT AAAC ACATT CT ^ V 
CTCACTCTCACAAAATCCCATTCCACCCATAC . ACACCCTCAAAC~w^-^~ ^ 
CT AACTCTG ACAAAATCCCATTTCAC CC AT ACT ACT CCCT C AAAC AC A . . ^7--^^ 
CAAACTGCGATAAAATCCCATTCCACCCCTATTTTTCCTTAAAAGATA • ^^^-^ 
CAAACTCCGATAAAATCCCATTCCACCCCTATTTTTCCTT.=v.AAAGATATCC ~ | 

CAAACTCCCATAAAATCCCATTCCACCCCTACTTTTCCTTAAAAGATATC 
CAAACTCCGATAAAATCCCATTCCACCCCTACTTTTCCTTAAAACATA *-~^t~X2 
C AAACTCCC AT AAAATC CCATT C CACCCCTACTTTTCCTT AAAAG AT A^^C j^GG 

CAAACTCCCATAAAATC-r^ATT^TACCCCTAC Z Z „T AAA^^ A . ~ ~ 

CAAACTGCCATAAAATCCCATTCCACTCCTATT TTTCCTTAAAACATA ^2 
O\AA^T0CCATAAAAT^^CATTCCACCCCTATTTTTCCTTAAAAC ATATCC • ^^J.^^ 
CAAACTGCGATAAAATCCTATTCTACCCTTATTTTTTCTTAAAACATATTCTAGC 
CAAACTCCGATAAAATCC-ATTCCACCCCTATTTTTCCTTAAAAGATA.C^^A^o^^^A 

CAAACTGCCATAAAATCrCATTTrACCCCTATTTTTT-TTAAAA^ACATTC^o^^^^ 
kTAAAATCCr* 7TTCACCCCTA TTTTT TTTTAAAAG ATATT - . -~ 
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CAAACTCCGATAAAATTCCATTCCACCCCTATT^ 

CAAACTGCGATAAAATTCCATT CCACCCCTATTTTTCCTTAAAAGArACCCTAGGA^CA 
CAAACTGTG ATAAAAT C C C ATT C CAC C CCT ATTTTT CCTT AAAAG AT AT CTT AGC A > t ,A 
CAAACTGCGACAAAATTCCATTCCACCCCTACTTCTCCCTGAAAGATATCCTAGGCCTTA 
CTCACTCTGACAAAATCCCATTCCACCCCTACTTCTCCCTAAAAGATGCCCTAGGACTAG 
CCAACTGCGACAAAATCCCATTCCACCCATACTTTGCCCTAAAGGACATCCTAGGATTCA 
CAAACTGCGATAAAATCCCATTCCATCCCTATTACTCTCTC.AAACACCTCCTAGGA..CA 
CAAACTGCGACAAAATCCCATTCCATCCCTACTACTCTCTAAAAGACCTTTTAGGAGTCA 
CAAATTGCGACAAAATCCGATTCCACCCATACTACTCTCTC.AAAGATATCCTAGGATTTA 
CAAACTGTGACAAAATCCCATT CCACCCCTACTACTCTCTCAAAG ACCTCCTAGG . _^A 
CAGACTGCGACAAAATCCCATTTCACCCCTACTTCTCCTTCAAAGACATCCTAGGA^ . lA 
CTAACTC CG ATAAAATC C CATTCGACCCATACTTCTCCATAAAAGACATTCTAGGCTTTG 
CAGACTGTGACAAAATTCCATTCCAGCCATACTACACCACAAAGGACATCCTAGGCTTCG 
CAAACACTGACAAAATC C CATT C CA C CCTTATTTCT CAT AT AAAG AC CTTTTAGGCGTCA 
CAAACACCGATAAAATCCCTTTC CATCC CTACTTCTCATACAAAG ACCTATTAGG ACTCA 
CAAATACCGACAAAATCCCCTTCCACCCCTACTTCTCCTACAAAGACTTACTAGGACTCA 

CTAGCACAGATAAGGTGCCATT C CACCCATATTACACA 
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:ax 



TATTAATAATTCTAGTCCTAATACTCCTAGTACTATTCATAC C CGAC CT ACTAGGAGACC 
TATTACTAATTCTAGCTTTATTACTCTTAGTATTATTCAOA.CCTGACCTACTTGGAGACC 
TACTACTAATTCTAGTCCTCATACTACTAGTA.TTATTCACACCCGACCTACTTGGAGACC 
TACTACTAATCCTAGCCCTTATGTTGCTAGTATTATTCGCACCCGACCTACTTGGAGArC 
TACTACTAATTCTAGCCCTCATACTACTAGTACTATTCGCACCCGACCTACTTGGAGACC 
TATTACTAATCCTAGC C CTCATACTACTAGTACTATTCG C AC CCGACCTGCTCGG AG ACC 
TArTACTAATTCTAGCCCTCATACTACTAGTACTATTCGCACCCGACCTGCTCGGAGACC 

TACTACTAATTCTAGCCCTCATATTACTAGT ACTATTTGCAC C CGACCTGCTCGGAGACC 
TACTACTAATTCTAGCCCTCATACTACTAGTACTATTTGCACCCGACCTGCTCGGAGACC 

TATTACTAATT CTAGCCCTAATACTACTAGTACTATTCG CGC C CGATTTACTTGG AGAC C 
TGCTACTAATCCTCACCCTCACACTACTAGTACTATTTACACCCGATCTACTCGGGGACC 
CACTGCTAATC CT CGCCCTGATATTACTAGT ATT ATTTACACC CGAC CTACTCGGAGACC 

TGCrACTAATTCTrrGTCCTAATA— ^ 

T ACTACTAATTCTTGTC CTAATATTACTAGTACTATTTATAC C CG ACCTAC TTGGAGACC 
TACTACTAATTCTCGCCCTGATGCTACTAGTACTATTCACACCTGACCTACTCGGAGACC 
TACTACTAATCCTCACCCTTATACTACTGGTACTATTTACACCTGACCTACTCGGAGACC 
TACTACTAATCCTCACCCTCATACTACTAGTACTATT^ACACCTGACCTACTCGGAGACC 
TACTACTAATCCTCACCCTTATTTTACTGGTATTATTCACACCTGACTTACTTGGAGATC 
TACTACTAATCCTCGTCCTCATGTTGCTAGTACTATTTATACTTGACGTACTTGGAGACC 
TACTACTAATCCTAATCCTCATATTACTAGTACTATTTTCACC CGAC CTACTCGGAGACC 
TCCTACTAATCCTCACCCTCATACTACTAGTACTATTCACGCCTGACCTACTCGGAGACC 
TCCTACTAATCCTCATCCTCATGCTGCTAGTACTATTCACGCCTGACTTACTTGGAGACw 
TGCTACTAATCCTCACCCTCATACTACTAGTACTGTTCACACCCGACCTACTCGGAGACC 

T ACTACTAATCCTT ACCCTT AT ACTACTAGT ATTATTC ACAC C CGAC CT AC I .CGAC ACC 
TACTACTCATCCTCACCCTTATACTACTAGTATTATTCTCACCCGACTTACTCGGAGACC 

T ACTACTCATTCTAG CC CTAATAATC CT AGT ATT ATT CTC A C C C G ACTT AC i .CCACACC 
TATTACTTATTCTAGCCCTAATAATCCTAGTACTATTCTCACCCGACTTACTCGGAGA I C 
TATTACTAATTCTAACTCTAATACTACTAGTACTATT CGCACCGGACCTCCTCGGAGACC 

T ATT ACTAATTCT AG C C CT AAT ACTTCTGGT A CT ATT C AC AC C C 
T ACTACTTATTCTAGCCCTAATAATACTACTA CTATT CGCA C C CGAC 
TCCTATTAATCCTAGCCCTAATACTATTACTAC 
T ACT ATT AATCCTAGCCCT AAT ACT ATT ACTA C 
TACTATTAATCCTAGCCCTAATACTACTA 

TACTCCTAATCCTAACTCTAATACTCCTAGTACTATTCGCACCCGACC 

TACT ACTAATTCT AGTCCT AATACTTCTACTTTTA~ 
TCCT ATT AATCCT AAT ACT AAT ACTCCT ACT ACTA" 
T A CTGCTAATC CT AGTC CT AATG CTCTT A» " A TT A TT 
TACTACTAATCCTGGCCCTAACACTATTAGTA CTA TT 
TCCTACTAATTCTACTGCTAACACTCTTAGTTTTArT 



zctcggagacc 
:ggggacc 



•A~ CGCACCCGACCTCCTCGGGGACC 
ACTATTCACACCTGACCTCCTCGGAGACC 

TCTCGGACACC 
-GACCTACTTGGAGATC 



•GACCTAC" 
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TATTACTAATCCTAATCCTAACA^^ 

XACTATTAATTCTAGCCCTCATACTCCTAGTTCTATTCACACCAGATCTCCTTG«Au^C^ 
TACTAATAATCTTAGCCCTAATAATACTAGTACTATTCTCACCA 

T ACTTCTAATTCTCTT C ^^ATA^^ ATTAGT ATTATTjXG^ CAG AC CTGCTTGGAG AC C 

TACTTCTAATACTCTTCCTAATATTACTAGTATTArrCG^ 
TACTTCTAGTACTCTTCCTAVTATTACTAGTATTATTCGCAC 

TACTTCTACTACTCTTCCT^^^ 

TACTTCTAGTACTCTTCCTGATATTACTAGTATTATTCGCACCAGACC^ 

TacTTCtacnacrcrrccTcaTA 

t£™c^ 

TACTCCTAATTCTCTTCCTTArACTACTAGTATTATTTGCACCAG 

TATTACTAATCTTAGTCTTAATAACACTAGTACTATTCACAw "^^"^J^^L^q^^q^c ^ 
TA^ACTAA^CTTAGTCTTAATAACACTAGTACTArraAC\CCTG«rrT rt ^GG«G^C : 

TATTACTAATCCTAGTCXTAATAACACTAGTACTATTCAC^ 

TAATACTAATCTTAGTCTTA\rAGTACTAGTACTATTCA 

TATTACTAATCTTAATCTTAATAGCACTAGTGCTATTTAC-^ 

TAGCCCTATTTCTJWKTCCW^^ 

TAGTCCTAATACTAGTCCTTCrATTACrAGTCCTATrTTCACCG 
TACTACTAA^CTAACCCTACTAGCACTAACCCTATtCGCACCGG 

-ACTACTAATTCTAACCCTACTAAC^^^^ 

TACT ACT AATCCTAACCCTACTAATACTAACCCTATT^GC^ 

TACTACTAATCCTAACCCTACTAATGCTAAC^^^ 

TGCTACTAATCCTAACCCTACTAATA^^ 

TACT ACT AATCCTAACCCTACTAATATTAACTCT.-*.- - ^-^"-—gC^GGAGACC 

TATTACTAATCCTAACCCTACTAATCTT^ 

TATTACTAATCCTAATCCTACTAATA^ 

TACTACTAATCCTCACCCTACTA^ 

— - — ~a -xr^^TACTAGCATTAACCCTA- j. - — — * 
TATTCCTAATCCTAACCCTAC~AGCACTAACCCTArTC^C^-^w«- - i - CGAGACC 

TATTCCTAATTCrAACCCTACTAGCACTAACCCTATTCACC-^^rt^>-^-^^^^^^^^ 
TATTCCTAATTCTAGCCCTACTAGCACTAACCCTATTCGCCCCiGr\CC^rtCj^Gi^AGACv. 

TATTCCTAATTCTAATCCTACTAGCACT^ 

TATTCCTAATTCTAGCCCTACTAGCACTAACCCTATTCACCCC "-^;^7^t^ C ^ 
TA"^CCTAATCCTAACCCTACTAGCACTAACCCTGTTCACCCCTG 

TATTCCTAATTCTAGCCCTACTAACACTAACCTTATTCACCCCCj<^^C^«>^~^^^^^^ 
TATTCTTAATTCTGGCCCTACTAGCACTAACCCTATTCACCCC — 
TATTCTTAArrCTGACCCTACTAGCACTAACCCTA— AC-C^ r: -^- Z ^ GAc ^ cc 
TACTCTTAATCCTAGCACTACTAACACTAACCCTATT.-AC---^----^— ^ ccacacc 
TACTCTTAATCCTACCACTACTAACACTAACrCTA. . - - ~Z.^~ZZZ . r-ACCAGACC 
TACTCTTAATTCTAACATTACTAACACTAACC^ 

TACTCTTAATCTTAGCA. .Al ..-^Ov_.~- ~- • -~~ZZZ~, A — aGGACACC 

TACTGCTAATCCTAACACTACTAACACTAACrC . A . . —j^ZI-l^-AGCACACC 
TACTCTTAATTCTAACACTACTAACAC -AACC . .A~. rt u - " ^ ^T^-- , .~~ aC GAGACC 

TACTGTTAATCCTAACT r I^ ' --AArA^^^"^™^""'" -'.'-••--•-'^'^a'^'^TACTAGGACACC 
TACTCTTAATCTTAGTC'ITA^TAACA'. . AAC- n 
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TACTCCTAATCTT.^ACCCTACTACCACTGACCCTA— CAC7CCAGAC*. .ACTAGGAGACC 
TACTCCTAATCTTAACCTTACTAGCACTCACCCTATTCACTCCCGACCTACTACGAGACC 
TACrCCTAATCTTAACCCTACTAGCACTAACCCTATTCACTCCCGACCTACTAGGACACC 
TACTCCTAATCTTAACCCTACTACCACTAACCCTArr CACCCCCGACCTAu .AGGAGACC 
TACTCCTAATCTTAACCCTACTAGCACTAACCCTATT CACCCCCGAC > ACTAGGAGACC 
TACTCCTAATCTTAACCGTACTACCACTAACCCTATTCAC2CGGGACCTACTAGGAAACC 
rACTCCT^rC"^CCCTACTAGCACT.^CCCTAr?CACCCCCGACCTA~AGCAGACC 
TACTCCTAATCTTAACCCTACTAGCACTAACCCTATTCACCCCCGACCTAGTAGCAGACC 
TACTCTTAATCTTAAC CCTACTACCACTAACCCT7ATTCAC C C CTCACTTACTAGG AGACC 
TACTCTTAATCTTAACCTTACTAGCATTAACCCTATTCGCCCCCGACCTACTAGGAGACC 
T ACTTTT AAT C CT AAC CTT ACT AG CACT AA C C CTA TTTA C C C C C C A C CTACT AGGAG AT C 
TACTTTTAATCCTAACTTTACTAGCACTAACCCTATTCACCCCCGACCTACTAGGAGACC 
^ACTCCTAATCCTGACCCTACTAGCACTAACCCTATTCACCCCCGACCTACTAGGAGATC 
TACTACTAATCCTAACCCTATTAACAGTAACCCTATTCACAC ^G ACCTCCTAGG AG ACC 



TCAT C CT AAT CCTAAC _ 

^ ACT ACTAATCT^AGCACTACTCACACTGACCCTGTTCGCAC CTG ATCTACTAGGAGACC 
^ ACT ACTAATCCTATC C CTACTTACACTAAC CCTGTTCGCACC CGAC CTGCTAGG AG AT C 
TTCT ATT AATATTTGTT CTACT CACACTAACCTTACTTGCA.C CAGACCTACTCGG AG AT C 
TACTATTTATTCTAACTTTACT^ 

T ACTACTAATCCTAGC C CTACTCACGCTAACCCTATTTGCACC CGAjCCTACTAGG AGAGC 
TACTATTAATCGTAATTCTACTCGCACTAACCCTATTCGCACCCGACCTGCTAGGAGACC 
^ ACT ACT AAT~CTAGCC CTACTCACCCTAACC CTATT CGCAC C CGAC CTGCTAGGAGAC C 
T AC 17 ACTAATT CTAAC C CTACT C GCACTAACCCTATT CGCA.C CTGAC CTGCTAGG AGAC C 
TACTTATACTAACC CTATTTGCAC CTG ACCTACTAGGA.GA.CC 



TACTATT AATTCT GGC 
TATTACTAATCCTAG 



itactcacactaaccctatt 



TATTATTAATTATAGTCCTACTTATACTAACCCTA 
TATTAATAATCCTAACAATACTCACGCT"" ' — 
TACTTC 




T C CTAGGGGAC C 



TACTCCTAATAACAACACTACTCACACT 

TACTTCTAATC CTAGCCCTACTCAC C CTAGTTCTATTCT GGC CTG AC CTGCTAGG AG AC ~ 
"^GC^TCT AATTATAGTATTACTCACCCTAGTCCTATTCTCCCCTG ACATCCTAGGvjvjA*- 
TACTCCTAAT CCTAGCACT ACT CGCC CTAGTTCTATTCT CAC CAGACAT C CT AGGAG AC C 

Z CTAGTATTA TT CT C C C CTGAC CTC CTAGG AG ACC 



TACTCATAATTATAG CTCTTCTAATC CT AGTACT ATT CTC 
TATTCATAATACT AATCCTG^AATCCTAGTATT ~ - * c ~ Z : 



rTATAATACTAATC CTACT AAT C CTT GT ACT ATT CTCAC ZS- ^ 
rTTCT ACTCGT ACT ATT CTCAC GAG AC CT- 



TACTACTTATTCTG ACCCTACTTCTACTCGTACTA , . CTC 
TACTACTTATTCT^ATTCTACTCCTACTCGTACTA I - CT' 
TGCTACTAATATTAATTCTCCTTATT' 



:CTACTGC 

TC CTACT AATCTTG ATT CT AAT ATTACTAGTAATAT 
TCTTACTAATCTTAATTCTAATATTACTAGTAATATTT 
kTCCTAATACTACTAGTAATATTT 
I CT AAT ACT ACT ACT AAT A 



iTCTCCTCCCAGACC 



'CACCAGATCTGCT AGGAG ACC 

—ACTA ^^TTAACCCTAATACTACTAv.-VAA . A _ . -'CACCCCACCTGCTCGGACACC 
T^~CTAATCCTAATCCTAATACTAC^ 

TACTT ~~~~ ~ ~ ~ * ~ " " * " — ■ - — " 



TACTCCTCATCCTGGTCCTTATACTACTAGTACTG - . C 

TACTCCTCATCCTACTTCTAACACTACTAGT - - - - - 

~A'~^CTTATTCTAATCCTAATACTACTAGTATTATT™ 

"-~AACCCTAATATT ACT ACT ATT ATT TT TACT - -~ — - * ' 

' rACTGTTATTTTTACCCGACTTATT AGGAG ACT 

AG—- r-i ~ — ACCCGACCTACTGGGACAT~ 




:ct' 

TACTCCT' 

TACTTCTTATTTTAACCCTAATA^ wV: 

tacttctaatcctaotttttata: 
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CTCTACTCCTA . ^ 
CATC ACATT ACT ACT ATTCTTACCTCAu * *GC. .CCTCATC 
TAACACTCATACTACTCCTCCTATTTTCACCACACCTCCTACCACACC 



T A CTC CTT AC 
TACTACTACT 

TACTCCTACTCCTAATCCTAATATCACTAGTTTTAT 
TAATCCTAATTATACCTCTATCATCACTACTATTA . 
T AATCCTCATCTTAAT CTTCATAAC C ' 
TAATCCTTATCTTAA 

TTA 



;CTATTACCACACC 
TTCACCTCACCTACTACCACACC 
'U. l - . CACCCCTGATC^ - 
CCCGGA.TC . - C" 



\ATAAACTTACTA CTATTTTCCCCCCATC T ~ » T ACCACAC C 
'A T AAACTT AG TACT ATT CT C C C C C C AT ~ J^^*^^ ^ ^ ~ 

^CCCTCCTATTCATAATCTTA^ 
C^ATTCTCTTACTAACTCTGTTCTCCCTA 
T CTTT CTT CT ACTAATCCT ACT CAC CCTAGT C CT ATT CT C C C TAG AC ^A^AGo^w A^ ~ 
-C™C^^CTACTATGCCTATTCTCTCTAGTACTATTTrCCCCCGATC^ , ,~G^~^ 
TCCTC—ACTATTATCCCTATTCTCTCTAGTACTATTCTCCCCTCACCTCCTGGCAGACC 



rCTAACCCTAT 



'CCCC 

: :- -cCTATTCTCCCCCGACCTTCTAGCACAC: 
TCTAGTTCTATTCTCCCCTGACCTTTTAGCAGACC 




TTTTCC 
TAATCATAT 
TC 



TGA C CTTGA.TAACATTAACACTATT CT CAC GAG ACCT C GTAGG AGACC 
CTCTT C CCTCTCTC CTTAATG ACATTAACACCATTCT CAC CAGAC CTCCT AGGCGACC 
CCTTACTCCTACTAACC; 



CTCTTATAT 
CACTCATGCTCAC 
C ACTCATGCT CAC G 



TTATCCTAATTTTACTCCTTCTACTCTTAGCCCTACTATC 
CACT CATGCT CATTCCATTCCTGAC ACT AGCCCT ACTCTC C 
7TATCC CATTC CTAACACTAGC C CTA ITC 
ZC C CTCCTCACACTAG CAT 
CCT 



ZT CTTAGGTG AT C 
rCTCCTAGGTGACC 



CACCAAACCTACTAGGTGACC 




CACT CATACT CACTC CT CTC CTCA CA CTAGC CTTA. 
CACTCA.TACTCACTC CC CTCCTCACACTAG CCTTAT 
GACTCAT ACTCGCTC CATTC CTTACACTAAC CC 



" CAT*** C CT AA CACT AG C CCTATTCTCACCAAATC TTCZ Guuw^ - 

, - - . ^ — r ^,— — <~~ - AC — AAC CTT CTGCG CCACC 

CACTTATACTCACCCCA- i .*u*w*u . - - — ^ ^ 

CACTTATACTCACCCCA TTCCTCACACTAGC CCT ATT CT CAC „ ^u-v^ v*v- >m*v- ~ 

C C CTTATATTCAT C C CATTCCTGACACTAGC C " *^ ~ ^ CAAC CT CCTAGG CGAC C 

rCCC'CAACCTCCTAGGAGATC 



A CTAGC AGACC 
.CT AGG AG AC C 




TACTCATACTACTC CCACTCATAAC 
CACTCATACTACTTC 

CACTCATACTACTTCCACTC AT AAC C CTAGC C CT ATT 
CACTCATACTACTTC 
TACTCATACTA 
TACTCATACTA' 



TACTCATACTACTTC 
TACTCATACT 



TAAC C CT AG C C CT ATT 
iTCCACTCATAACCTTAGCCCTATTCTCAC 
C CACTCATAAC C CC ACCT CTA TTTTCA 

CTATT 





tactcatactacttccacttataaccccagct* 
cactcatattacttccactca.aaccct; 
tact cat attactt c cact cat aac c ct ag ctc 7a. . tt ' 
ctcataaccctagctcta: 
c ct ag ct ct ac 
cct agccct ac 
—ccta: 



TATTCATATTAC . . - 
TATTTATATTACTTC 
CACT TATA 

IT CT'CA T A TTT A T C 
CACTAATA CTCCT CCT CCT AA CC . 




ACT AGG AGACC 
ACT AGG AGACC 
ACT AGG AGACC 
ACTAGGAGACC 
AC 



ACTAGGAGACC 
ACTAGGAGACC 



< rt,C v C^A . i * «• #"*C . - 

T A CTC AT AT AC CCCC 
TACT CAT A TACCTCCCCCTAA 



CCT AGG GG AC ^ 
ACTAGGAGACC 
ACTAGGAGACC 
ACTAGGAGACC 
CTAGC AGACC 
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TGCrCATACTCCTGCCCCTAATAGCCCTACCCCTATTCACCCCAAACCTCCTAGGAGACC 3 o a 
TCCTCATGCTCACCCCCCTAATAGCACTAGCCCTArrCTCACCAAACCTCCTAGGAGACC 3o0 
CAATCATACTAACACCACTAATAACCCTACCCATATTCTCTCCTAACCTCCTAGCAGACC 3 o 0 
TACTAATATTCGCACTCCrAGCTTCCATAGCCCTATTCTCCCCAAACATACTAGGACATC 3 a „ 
TTCTAATACTAACCCTCCTACTAACCCTAACACTATTCTCTCCAAACCTTTTAGGGGACC j a 0 
TCCTAATACTAGCCTTCCTGCTAACCCTAACACTATTCTCTCCTAACCTTCTAGGACACC 3 o 0 

TTTTAATACT^CTTTCCTCCTAACCra 3 a0 

TCATTATACTGTCTGTTCTACTACCCCTCGCCCTTTTCTCACCAAACGTTCTAGGCGACC j q 0 
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CAGACAANNACATCCCCGCAAACCCACTCAACACCCCTCCCCACATCAAGCCCGAATGGT 
GA.GATAACTACACCCCAGCAAACCCACTCAACACTCCCCCTCACATTAAACCAGAATGGT 
CAGACAATTATACCCCAGCA^TCCACTTAGCACGCCCCCTCACATCAAACCTGAATGAT 
CAGATAArTATAC^CCAGCAAATCCACTTAACACACCCCCrCACATCAAACCCGAATGAT 
CAGACAACTATGCCCCAGCAAACCCACTCAACACGGCCCCTCACATTAAACCCGAATGAT 

CAG ACAACTACACCCCCGCGAACC cacttaacacac c c c ct cacatcaagc ccgaatg at 
CAGACAACTACACCCCCGCGAACCCACTT^^ 
CAGACAACTACACCCCCGCV^CCCACTTAAT 
CAGACAACTACACCCCTGCAAACCCACT^^ 

GAGACAACTACACCCCCGCAAATCCACTCAACACACCCCCTCACATCAAGCCCGAATGAT 
CAGACAACTATACCCCAGCAAATCCACTCAACACACCCCCTCATATTAAACCTGAATGAX 

CAGACAACTACACCCCAGCAAACCCACTC^^ 

CAGACAACTATACCCC^GCAAACCCACTCAATACACCCCCTCACATTAAACCTGAATGAT 
CAGACAACTATACCCCAGCAAATCCACTCAACACACCCCCTCACATTAAACCTGAATGAI 
CAiGATAACTATATCCCAGCAAATCCACTCAATACACCCCCTCATATCAAACCTGAGTGv>T 

CAGATAACTATACCCCAGCG^CCC^^ 

CAGATAATTACACCCCAGCGAACCCACTCAACACACCCCCTCACATTAAACCCGAGTGAT 
CAGACAACTATAC C C CAGCAAAC C CACTCAGCACAC C C C CT CACATTAAAC Ct GAATG AT 
CAG ATAATTATACC C CAGCAAATC CACT CAACACAC C CCCT CACATCAAAC CTGAATGA^ 
CAGAC^TTATACCCCAGCAAACCCCCTCAACACAC^^ 
CAGACAACTACACCCCAGO\AACCCACrr^^ 

CAGACAACTACACCCC\GCAAACCCACTTAACACTCCCCCTCACATCAAACCTGAATGA^- 
C^GAC^CTACACTCCAGCAAACCCACT 

CAGACAACTATACCCCAGCAAACCCACTCAACACAC C C CCT CACATTAAAC CAG AGT^A-^ 
CAGACAACTACACTCCAGCAAACCCGCTAAATACACCTCCCCATATC^ 

GAGATAACTACACCCCAGCAAACCC^CTC^CACACCTCCCC^TA 
CAGATAATTATACTCCAGCAAACCCACTTAACACACCTCCCCACATCAAGC CCGAATG 

CAGATAACTACACCCCAGCAAATCCACTTAACACA.CCTCCCCACATCAAAC 

CAGACAACTACACCCC^CCAAATCCACTCAAC^ 

CAGACAACTACAC C CCAGCAAACCCACTTAGCACAC CTC CC CAT ATT AAGCCCGAATGGT 
C^GACAACTAC^CCCC^GCAAACCCACTC^ 
CAGACAACTACACCCCACCAAACCC^CrCAACA^ 
CCCAC^CTACACCCCAGCGAACCCCCTCAATACACCT^ 
CCGACAACTACACCCCAGCAAACCCACTCAACACACCACCTCATAT^ 
CTGACAACTATGCCCCACCAAACCCACTTAACACGCCCCTCACAA . * AAACCTGAATG AT 
CTGACAATTATACCCCAGCAAACCCACTTAATACACCTCCCCATATTAAAC CCGAATG AT 
CCGACAATTATACTCCAGCAAATCCACTGAACACACCCCCTCATATTAAACCCG 

CCCACAATTACACCCCACCAAACCCACTCAACA^ 
CAGAC^CrACACCCCCCCAAACCCTCTTA^ 

CTGACAATTACACCCCTGCAAACCCGCTCAACACACCCCCTCATATCAAACCC^A 

CAGACAACTACACACCAGCAAATCC^CTC^^ 
CACACAACTATACACCACCAAACCCACTAAATAC^CCC^ 
CAGACAACTATACAOCAGCAAACCCACTTAATACACCCCCACATATCAAG 
CAGACAACTACACRCCAGCAAACCCACTTAACACACCCCCACATATTAAAC^^GA 

CACACAACTACACACCAGCAAATCCCCTTAACACCCC^^ 
CACACAACTATAC^CCAGOVAACCOVCrC^ 
CACACAACTATACACCACCAAATCCACT 
CCCACAACTAC^CACCACCTAACCCACTC^ 
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CTGATAACTACACCCCAGCAAACCCACTAACCACCCCTGCACACATCA^CC^^ ::;:: 4-0 
1 age . a lb CCCATAACrATACCCCAGCAAATCCACTAACCACTCCTGCACACA CA^^AoAA 0 

soca. f lu CCGACAACTATACTCCAGCAAATCCACTTAACACCCCTGCACACATC^C^G^ :::: .-0 

del . leu CAGACAATTACACCCCAGCAAACCCACTAAACACCGCCGCACACA.CA^^Ao^.^ . ,-0 

CTGACAATTATACCCCAGCAAACCCACTAAGCACCCCTGCACACA .CAAACw*^.^ . 4 - 3 
CCGATAACTACACCGCAGCAAACGGGCTTAATACCCCAGCACATATCAAACCAuAG^« : 4 ; 0 
CCGATAACTACACCCCAGCAAACCCGCTTAA~ACCCCAGCACATA^A.AAC^uAG^^^ , Q 
CTGACAACTACACGGCAGCAAACCCACTAACCACCCCGGCACACA- ^CCAG~w«^ 4-0 
s . n C CGACAACTATACCCCAGCAAACGCACTAAGCACCCGCGCACACAi. i «A«C^AG«A i v J A. ,-0 

' ca- CTGACAACTACACCCC\CCAAATCCACT.AAATACCCCAACACACATCAAACCAGAATGGr 4 2 0 

rVfo ' v-c c-GATAATTATACCCCAGCAAACCCACTAAACAC7CCCGCACACATCAAACCAGAATGA~ 4 2 0 

C-CG AT AAC7 ACATTC CAG C AAAC C CACTAAGCACC C CAG CACACATTAAAC CAG AATG AT 420 
CCGACAACTATACCCCGGCAAACCCGCrCAGCACCCCAACACATATTAAGGCAGAATGAT 4 20 
CCGATAACTATACCCCAGCAAATCCACTCAGCACCCCAGCACACATTAAGCCAGAATGA, 4 - 0 

CCOACAATTACACCCCAGC^ J;? 
C CG ATAATTATACT C CAGCAAACCCACTCAACACTCCAGCACACATC2AAAC CAG AGTvjv^i 4 0 
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pla" -siia 
■ccgi .ire 



choc . s i.n 
bera .bai 
ziph . car 
me so . eur 
mesc - bid 

mesa den CCGAT^TTATACTCCAGc;;^^ 

meso~ : per CTGAGAATIA^ ' ~ ' 

pcnc .bla 
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cagacaactatatcccagcaaaccccatgaataccc If 

k< =3 . am= CAGACAACTA^CCCCCGC^CCCCCTTAG^ 4 0 

ctgacaactacacccctgccaatcctct^ ; : J 

e^u cagacaactacaccccagctaa^^ J; 5 

-*lc 14 cagacaactataccccagcaaacccattaaacac^^ 

' ;« . ewb3 CAGACAACTACACCCCAGCAAACCCACTAAA^^ «0 

^ ela ccgacaactatactcccc-ctaaccccct^^ til 



tania.=la CCGAC^CTATACTt^^^v.-^iw^y^v.^ — --- -- ,, TGAT 4 -> Q 

"ama «I CCGACAACTATACTCCCGCTAACCCCCTGAACAOACCGCCTCATATTA^ 4-0 

;tTvic CCGAC2AACTATACCCCCGCTAACCCCcrTTAACACACCACCCCA^ -J 



1 1 



CCGACAACTATAC=^<-V-W<- ..acv-wuv— i. i.^w*w»v.w,. — -- — 

CAGACAACTACACCCCAGCCAACCCCCTCAGCACTCCACCACATAT— - — 
CAGAGAACTACArCCCAGCCAACCCCCTCAGTACTCCACCACATA-C 

eu „ jub CACACAACTACATCCCAGCCAACCCCCTCAGCXCTCCACCACArA-^ 

S£ a CAGACAACTATATTCCAGCCAACCCCCTCAGCACTCGACCACATA^iAAACCT 

~Ho ^ CCCACAACTACATCCCTC'cCAATCCCCTAAGTACCCCACCACA^ * - 0 

J: I ccgacaactatatccctgccaatcccctaagcaccccaccacata^a^c : tgaa : cg ; 4 - 0 

~; 3 ' c^i CGGACAACTATACCCCTGCCAACCCCCTAAGTACCCCACGACATA. .^- : ^G^- : A. 4 

' CCCACAACTATATTCCTCCTAACCGCCTAAGCACCCCACCACATA CA^C. J -0 

Vlt ' w:l cccacaactatactcccgctaatcccctaagt^ctccac^catajcaa^c : - : ^- J -J 

M; ccgacaactacacccctgccaatcccctaagcaccccaccacata ^^::X^:^: lf a 

,c, . scb CTGACAACTACATCCCTGCCAACCCCJTAAACACT^C Z CgS-^cC«AArCGT 4 20 

v -U. Mi CTGACAACrACATCCCCGCAAATCCATTGAGCAC.Cv-AC^-^CA.^^C- : a _ ^ q 

: ; L C .V CTGATAACTATACCCCCCCAAACGCACTGACCACCCOXCCC^CA^AAA^--- ^ q 

CTCATAACTATA— CCCGCTAACCCATTAAGCACACCACGCGA.A^AAAC.^GAG ^ = q 

CACACAACTACATCCGAGCCAACCCTTTAAATACCCCTCCCGA.^^^ — - ' - 4 20 
CAGATAACTACACCCCTG-GAAACGCGGTAAACACGCGTGCACATA ^AAAC- .CmC* 
CAGACAATTACATCCCGGCAAACG2GCTAAACACACO\CGCGATA -^C^-^^ 
- CAGAC^CTA^ACCrCAGCGAACCCACTGAACACGCCTCCCCACATGAAAC-ACAA^Ai 

3 1- CACA^CTA.AL-LLA rr - --2 — a~CAAGCCAGAGTGA v 4-0 

r a v,[ CAGACAACTATAGTCCAG>-CAr\CGGA< AAC^uC ^.A^-a^ .-r;AT 4=0 

.Jo .' ph.. C-rGACAACTACACCCCCGCCAACGGACTrAACACGTGTGGTGATATT AAAV-^C-A . >A 
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ACTTCCTCTTNGCATACGCAATCCTACCATCAATCCCCAATAAACTACCAGG 4 7 2 
ATTT^CTATT^CATATGCAATCCTACGATCAATCCCCAATA^CT^A^ 4 . : 

ATTT^CTATTTGCATATGCGATCTTACGATCAATGGCCAAC^ J72 

ArrrrrrATTCGGGTA^v-AA. -..^.^ 

A c _ A7TTGCATATGGAA7CCTACGATCAATGCCTAACA~^ i A«w«Gw 4 < - 

A :i:_-^l Arrr -CA7ACGGAATCCTACGATCAATCCCTAACAAACTAGGAGw 4 

A"""^"— ^GTA — rTGCATACGGAATGCTACGATGAATCGGCAATAAACTAGGAGG 4 , - 

A^CCTATTCGCATACGCAATCCTACGTTCGATCGCCAACGACCTAGGAGG 4 72 

AC-^TCC^A-rrCGATATGCAATCCTACGArCAATCCCCAACGCACTAGGAGG 4 i 2 

ACTTCCTATTTGCATACGCAATGCTACGATGAATCCCTAATAAACTGGGAG^ 4 7 2 

ACTTCCTATTTGCATACGGAATCCTACGATGAAT/rGCCAACAAGCTAGGAGG 4 7 2 

A""TTCCTATTTGCATACGCAATGCTACGATCAATTCCCAACAAACTAGGGGG 4 7 2 

A"*"TTT' — "A"" """'GCATACGCGATCCTACGATCAATTCCCAACAAACTAGGAGG 4 7 2 

AC^TC'~T ACTTGCATACGCAATCCTACG ATCAATC C CCAAGAAACTAGG AGG 4 72 

^^ G ^ GC . xr ATGCGATGC~ACGAr.GAArTCCCAACAAACTTGGAGG 472 

A-r4— cVtATTTGCAT ATGC AATT CTAC GAT CAATCCC CAACAAACTTGGAGG 4 72 
A ^ CCTA ^ GC ATATGCAATCTTACGATCAATCCCCAATAAACrAGGv.Gw 4^2 
J;-^:^ GCATACG CAATCTrACGATC-AATCCCC^C^CTAGGAGG 472 

aSSgtatttgcatacggaat^^^^ ^ 

A ^ cc - A ^ GC ATATGCAATCrTACGArCAATCCCrAATAAACrAC^AG« 472 
aCTT^C^A^TTGCATACGGAATCCTACGATGAATCCCCAACAAACTAGGv.jv» * / - 
? c ^ cc 3t^G^TACGGAArGCTACGATCAATTGCTAACAAACTAGGGGG 472 
ACTT^C^^ATrTGGATACGCAATCTTACGAT GAATG C C CAATAAACTAGGAGG 472 

A^C^ATTTGCATACGCAATGCTAC^^ 4,2 
A ^CC"ATTCGGGTACGCAATTCrACGATCAAT7CCAAATAAA : ^Gv^G. 4 / . 
• ACT^CTTATTTGGATANGCAATTTi ACGGTCAATG £ C GAACAAACTAGGrtGvj 472 
^^^C^^CCCAATrrTACGAGCAAGCGCCAATAAACTAGGAGG 4 y 2 

aSgSg^cgcStacgg^ «" 

ACTTCCTATTCCSCATACGCAATCTTACGATCAGTTCCTAACAAACTMGAGw 472 
Aci^CCTATTCG CAT ACGCAAT CTTACGATGAATTG GTAACAAACTAGGAjvj 4,2 
^r^C^CGCATATGCAATGCTACGATCTATCGGC^CAAGCrAGGAG^ 4 ,2 
^-^: C Z^ TCGCATATGC AATGCTAGGATC^TGGGTAATAAACTAGGAGG 4 - 2 
A' ^^^ATTCGCATATGCAATTCTACGATGAATCGCCAACAAACTAGGAGG 4,2 
^C^t^CGCATACGCAATTTTACGGTCVVTTCGTAATAAACrGGGAGG 4.2 
ACTTG^ATTTGCATATGGAATCCTACGATGAATCGGGAATAAACT 4-2 
ACTTGTTATTNGGATACGCAATGGTACGATG.^TGGGOAATA^CT^A^ 4- ; 
ACTTTTTATTCGCATACGCAATGCTACGATCAATCCGCAATAAACTACGAGG 4 , . 
A^TCCTATTTGGATATGGGATTCTACGATCAATTGGCAACAAACTAGGAGG 4 , 2 
A^C^TTTGC^TACGCAATTGTGGGATGAATTGCTAATAAACTAGGAGG 472 
^G^TTCGCAT ACGGAATTCTGGG ATCG ATTGGCAACAAACTAGGACG 4 - 2 
A ^3^ A ^ CC ATACGCAATCCTGGCATCAArrGGTAACAAACTACGACG 4,2 
AC~T^GTAT^CGCATACGCAATGCTGGGATGAATTGGTAATAAACTAG«jAGG 4 ^ 

at4tcgtattggcatatggaatggtcggatcaatgggtaac^ctagcc^ 4 , 

A r-r —^ATTCGGATATGCAATTGTGGGGTGAATTGGCAATAAA. -ACw^CC « . . 

. — "rr^.-^"" — ~-- G AATAAACTAGGAw*-, 4 . . 

AGTTCTTATTGGCA .A .-,Cw . • ~ ~V . , rr ACC 4 7^ 

_., > — , — - -^ A -~ AA ---CTAACAAACTrtGGA^*-> •* - 

ATTTGGTA . „ — CATAGC^^^^A - ^^^a CCAGG 4 7 2 

ATTT C S*!3Z^*^l l TU^^^^-:Z;-tATAAACTACC<^ 472 

-™gtt Ayr : -^ A ;:-rrr:;J_:::?ixr^:::^ AA cAAACTAccACG 4 7 2 

ATTTCCTATTTSCA^CG^^. ,C*^^. ~ CAAACTAGGAGG 4^2 

ATTTGGTATTTGGATAGGCAATGGTAC^ 

ATTTGGTArTTGCA --=£A ^ ~~ A ^Cl^U - CAAACTACCAGC 4 7 2 

ACTTGGTATTTGCA.AC^CAA AC^«. _AA 7777, AOCACG 4 72 

. , , -'i-ACGCAA— — A--GA-GAAT-7GGGAACAAACTAGCAt/t, 

ATTTGGTA7TTGGATAGGGAATGTTACGATGAATTTGGAA.^AACTACGAs J G 



WO 02/077278 

csc . nip . pul 
cer .nip. nip 
cer.ela . sco 
cer . dam 
ran . tar 
mos . fas 
mos . leu 
mcs . chr 
mos .her 
mcs . mcs 
era. jav 
c rag. nap 
bala.acu 
bala .ben 
bala.bor 
bala.edi 
esch.rob 
bala .mus 
mega . ncv 
bala. pay 
cap .mar 
csph.ccm 
cepfc.eut 
lace. obi 
cepn.hea 
ceph.hec 
lage.aus 
lage.cru 
lace .obs 
lisso.bor 
lisso.per 
gic .mac 
glo . mel 
fere. act 
pepo .ele 
gram. gri 
pse . era 
lace .acu 
crci .bre 
crca .bre 
del .cap 
del . tiro 
del. del 
s ten. ciy 
sten . coe 
cur . adu 
seen. fro 
saus . chi 
s cer* - ion 
curs.tru 
lage - alb 
seen . bre 
scca . i iu 
del . leu 
mono . rnon 
p lac . gan 

kog L . bre 



54 



PCT/IN01/00055 



A -i cCTATTTCCATACGCAATCCTACGATCAATTCCCAACW^.MooMo^ 

A lU. CTAr ^ C CATACGCAATCCTACGATCAATTCCC^CAAACTAGGACG 

A^CCTA^GCA^ 

aSSa^cgcatacS 

A^CTATTTGCATATGC^^ 

AC^C^A'^TG CATATG C CT AC GAT C AATT C C CAACAAACTAGG AGG 
3 rr^^A"T^GCATATGCCATTCTACGATCAATTCCTAATAAACTAGGAGG 

A ^^C^A TTTG CATACGC AATT CTT CGGT CAAT C C C CAAT AAA CT AGG AGG 
A^TTCCTATTCGCATACGCAATCCTACGATCAATCCCCAATAAATTAGGAGG 

ACTTC CTATT CGCATACGCAA7CCTACGATCAATC CCTAATAAACTAGGCGG 
ATT~ 'CTA^rCGCATACGCAATCCTACGATCAATCCCCAATAAACTAGGCGG 
A--TTCCTA-TTGCATACGCAATCCTACGATCAATCCCCAACAAATTAGGCGG 
A^CCTA^TTGCATACGCAATCCTACGATCAATTCCCAAC^AATTAGGCGG 
ATTTC'^T V*TTG CATACGCAAT C CTACG AT CG ATCCCCAACAAATTAGGCGG 
jv*"t^CCTA~"'^GCATATGCAATCCTACG ATCAAT C CCCAACAAATTAGGCGG 
A^C^A^GCATACGCAATCCTACGATCAATCCCCAACAAACTAGGCGG 

A^TTTCTATT CGCATACGCAAT C CTACG AT CAATCCCCAACAAACTAGGCGG 
ACTTCCTATTTGCATATGCAATCCTAC GATCAATTC CTAATAAATTAGGTGG 
ACTTCCTATTCGCATATGCAATCCTACGATCAATTCCCAATAAACTTGwAGo 

A^CTATTCGCATATGCAATCCTACGArCVATTC^A^^^ 
A^CCT^CGCATATGeAATCCTACC-ATC^ATTCCTAATAAACTC^AGo 
A^C^CGCATATGCAATCCTACGATa^rTCCTAAT^C-CGGAGG 

ACTTCCTATTTGCATACGCAATCCTACGATCAATTCCTAATAAaCTTG«AGo 
ACTTTCTATTCGCATACGOlATCCTACGATCAATTCCTAATAAACTTGv^Gvj 

ATTTCCTA^CGCATATCCAATCTTACG 

A^cSc^ 
ATTTCCTATTCGCATATGCAATC^^ 

ArrrCCTATTCGa^TATGCAATCTTGCGArCAArrCCCA«CAAAC^G«A^ 

ATTTCCTATTCGCATATGCAATCC -ACG~- CAA- - * r ^, Ga 

ACTTCCTATTCGCATACGCGATCCTACGATCAATTCCTAA^AAACTCGGw^ 
ACrrrcrATTCGCATACGCAArCTTACCA-CAATCCCTAA-AA^C^^GG 

ACTXTCTATTCGCATACGCAATCTTACGATCAATCCCTAA.A^^^auv. 
ACT7TCTATTCGCATACG CAAT CTT ACC AT CAAT CCCT 
ATTTCCTATTCGCAT^^ 

ATTCCTATTCGCATACCCAATCTTACGATC^^ 
ACTTICrATreGCATACa^TC^ 

ATTTCCTATTCGCATATGCAATCCTACGATCAATCCC . ^;^^££cG 
ATTTCCTATTCGCATACGCAAT'CTT ACC AT CAAT ^ ^^^^^5^^^^-^^^ AGG 
AT^CTArrCGCATATGCAATCTTACGAT^CAATCC^^^^^^ 

:atttgcatacacaatcctacgatcaa : ' 



act: 



TCCT. 

ATTTCCTATTTGCATACCCAATCCTACCATCAATCC^ 
ATTTCCTATTTCCATAM^ 

A -rrrCC7ArrTGCATACGCAATCTTACGGTCAATC -C ^ * -AA- 



:acgc 

nTGCATACGCAATCTTACGv 
ATTTCCTATTTCCATATGCCATTCTACGATCCA t 



rCCTAACAAACTAGCGGG 
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CACACAATTATACTCCACCAAACCCACTCAATACACCCCCTCACATTAAAu^CA^ . .AT 4 2 0 

CCCACAATTATACCCCACCAAACCCACTCAATACACCCCCTCACATCAACCv-^AA-^ 4 2 0 

CAGACAACTACACCCCAGC7AA7CCACTCAACACACCCCCTCATATTAAGCCTGAA -«^T -V 2 o 

CAGACAACTATACCCCAGCAAATCCACTCAATACACCCCCTCACATTAAACC^AA.^ 4 2 p 

CAGACAACTATACCCCACCAAATCCACTCAACACACCCCCTCACATTAAACC^G^-^^. 4, C 

CAGACAACTATACCCCAGCAAArCCACTCAACACACCCCCTCACATTAAAC^o^A .v~. 4 . O 

CAGACAACTATACCCCAGCAAArCGACTCAACACACCCCCTCACArCAAAu^^---. * - P 

CAGACAACTTATACCCCAGCAAATCCACrCAACACACCCCCTCACATCAAAC. -~ -~ - - : ° 

CAGACAACTACACCCCAGCAAATCCGGTCAACACACCCCCTCACATCAAAcC 4 , 0 

CAGACAACTACACCCCAGGAAATCCGCTCAACACACCCCCTCACATCAAAC^AA . ^ - -. - Q 

CAGACAACTACACCCCAGGAAATCCGCTCAACACACCCCCTCACAtCAA^CCr^^^ . * - J 

CAGATAACTACACCCCAGCAAACCCACTCAACACACCCCCTCArATTAAACCTGAArG^. 4 0- 

CAGACAAATACACTC^GCAAATCCACTCAACACACC^CCTCATATTAAACCCGAATGA, 420 

CAGACAACTATACCCCAGCAAACCCACTCAACACTCCCCCTCATATTAAACCTGAAT^- 4-0 

CGGACAATTATACCCCAGGAAACCCATTAAATACGCCCCCACATATTAAACCCGAATGAT 4 - 0 

CGGACAATTATACCCCAGCAAACCCATTAAATACACCCCCACATA-TAAACCCGAATGA. 4 - 0 

CGGACAATTAtACCCCAGCAAACCCATTAAACACACCACCACATATTAAAC.^ 4 - 0 

CGGACAACTATACTCCAGCAAACCCA— AAATACACCTCCACATA, ^CC^-Gv, , 4 0 

CC3ACAAr»CACT<K^^ 4,0 

CCGA(^CTATACCCCAGC^CCCACTCAGTACCCCAGCAC^CA^^C^G^^ 4 - 0 

CCGACAACTACACCCCAGCAAACCCACTCAGTACCCCAGCACACATTAAACCAGAA^v^^ 4,0 

caaacaAcrAC^ccascaA^^ 7 : 0 

OGACAACXMraCCACrcu^ * Z ? 

CAGACAACTA^CCCCAGCAAACCCACTCAGTACCCC^ ~ 

CAGATAACTACACCCCAGCAAACCCACTCAGTACCCCAGCACACA^AAACv-rtGAG^^^- 4-0 

CXWCAACnCACCCC^^ 4 ; 0 

CTGATAACTATACCCCAGCAAATCCATTAAGCACCCCCGCACACATCAAACC^G^G^G^^ 4-0 

CTGATAACTATACCCCAGCAAATCCATTAAGGACCCCCGCACACATCAAACCAGArtTG^^ 4 0 

CTGATAACTATACCCCAGCAAATCCATTAAGCACCCGGGCACACATCAAACCAG«A*^*^ -0 

CTGATAACTATACCCCAGCAAATCCATTAAGCACCCCCGCAaACATC^C^^:^?; 7 ; 0 

CTGATAACrATACCCCAGCAAATCCATTAAACACCCCCGCACACATCAAACCrtGAA^>^T 4 - 0 

CTGACAACTATACCCC^GCAAATCCATTAAGCACCCGGGCACACATCAAAC^GAA^- - 

CTGACAACTATACCCCAGCAAATCCATTAAGCACCCCCGCACACATCAAACCAG«ATwrt^ 4 , C 

CTGATAACTATACCCCAGCAAATCCATTAAGCACCGCAGCACACATCAAACCAGAAi 4 _ 

CTGATAACTACACCCCAGCAAATCCATTAAGCACCGCTGCACACATCAAACCAGAA^v^^- 4-0 

CTG AT AACT ACAC CC CAG CAAAT C CATT AAG CAC C C CTG CACACA - CAAAC ^^G^A -^-j ^ - 

C-GATAACTATACTCCACCWTCCACTAAGCACCCCTCCACACAT^C^GAA-G^ 4 a 

CTGATAACTATACTCCAGCAAACCCACTAAGCACCCCTGCACACAaCAAAC^ 4 0 

CTGATAACTATACTCCAGCAAACCCACTAAGCACCGCrrGCACACATCAAAG^GAG^G^ 4 - 0 

CTAACAACTATACCCCAGCAAACCCACTAAGCACCCCTGCACACATCAAACCAGAA^^A^. 4 , 0 

CTGATAACTACACTCCAGGAAACCCGCTAAGCACCCCTGCACACATCA^^CCAGAA^Grt^ 4 - 0 

CTGATAACTATArrCCACCAAACCCACTAAACACCCCTGCACACA.C^C^CAA^A. - . 

CTGATAACTACACTCCAGCAAATCCACTAAGCACCCCTGCACACATCAAAC^CAA.-.^- 

CTSACAACTATACCCCAGCAAATCCACTAAGCACCGCTTwCACACATCAAACCAgAA^sjA- •« 
CTGATAACTATACTCCAGCAAATCCACTAAGCACCCrrCCACACATCA^C^CAA-^A- 

CTGATAACrATACCCCACCAAA-CCACTAAGCACCCrrGCACA - A -;^;^7l~,- 
CTGATAACTATACCCCAGCAAATCCACTAAGCACCCGTGCACATA . r^Z^CV < : 
CTGATAACTATACCCCACCAAATCCACTAACCACCCCTGCA(2ACATCAAAC^AA^^ 

CTGACAACTATACCCCAGCAAATGCACTAAGGACCGCTGCACA— A . ^-^^--A 1 4 2 0 
CTGATAACTATACCCCAGCAAATCCACTAAGCACGCCTGCACACA . _A~^C _A«J, • - - 



0 
0 
420 
4 20 

:o 



0 



CTGATAACTATATCCCAGCAAATCCACTAAGTACGCCGGCACACA - ^^^GAG^A. 4 - 0 
C^GAC^TTATACCCCACCAAATCCACTAACCACCCCTGCACACATJA^C^AA^ . - 
CCGATAACTATACCCCACCAA,\TGCACrAAGCACCGGTGCACAGA.CAAAC^AA. ; A ; ^ 
CTGATAACTATACCCC^GCAAATCCACTAAACACCGGTGCACACATCA«AC^VOA^ . - 
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baby.bab 
phac . af r 
sus . bar 
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■ d " ^^^GSTACGCAATCCTACGTTCAArCGCTAACAAACTAGGAGG 472 

hype . amp a^^T^^^^GCATATGCAATTTTACGATCAGTTCCTAATAAACTAGGAGG 4 7 2 

"""'STI ^CCTt^GCCTACGCCATCCTACGATCAATTCCCAATAAAGTGGGAGG 4 72 

I" ; t ScS'G^CG^TACGGAATTCTCCGATCAATCCCTAACAAACTGGGAGG 472 

***- llb ^cSG^CGCGTACGCGATrCTCCGATCAATCCGCAACAAACTAGGAGG 472 

hipp.«p AC^CC^A^TCGCCTACGCAATCCTACGATCCATCCCO^TAAACTAGGCGG 472 

diC . SU m A ^:S^GCTTACGCAATCCTACGATCCATCCC.AAACAAACrAGGCGG 472 

rhi-1 . son A IE^^ A l^SXliV^^V^-^. AC GAT-CArCCCTAACAAACTACGCGG 4 7 2 

a ^^gg^a^ 4 I 2 

A ^^^GCciACGGCATCCTACGCTCAATGCCaAACAAATTAGGGGG 472 

can . bac AmCCTACTCSC lt A ^; A ~Vr^^CC^CAA * 7 2 

arc. for M™ 003 ^^^ 472 

arc.gaz A^C^^S;^™^^^ 472 

oco - ros ATTTC S A S?r-icG^^ SSXcS-^ScCCAACAAACTAGGAGG 472 

pfco.fasciata ACT ^ A ^ GC ^ AC . G ^I;:^ 472 

pho.viC ACTrCCTATTTGCCTAC^CAA^CTTACG^.C-A-CCw^ 

cys . cri A CTTCCTA^=C S ATGCAATCCrAG^ 4?2 

hvd . Leo A ^ CCTA ^ GCCTAC ^^ A S^-^-C--^CAAACT^ 4 7 2 

Mr ;! aSSa^c^ ; 

:;:.sch . AcrrccwrrawoG^ 472 

sel . cHi ACT ™ A ^S^^^SaCC^ 472 

ATTTCCTATTCGCATA j. GOaATTCTACwjA — ^XV^ * ^r-r^nrttG 4 72 

ACTrCCTATTCGCATACGC^TTCTCCGATCCArCC™ ^. 
ATTVTCTATTCCCCTATGCTATCC^ACGATCCA^C^AA.AJ^^AC^^^^ 



ail.ful 
f el 
can 
cal 

gla . sab 



G C AT ACG CAATTlTAL ^rV. * u - * - . 

ACTTTCTATTCGGAT ACGCAA^CCT ACG ATCT^^ - — JI^^^^^^^a^cACG 



ACTXTCTATTCGCATA«--UAA . — ~~ J, ~ - AAA .r TA GGAGG 4 7 2 

- - - - - ACTTTCTAA JI; A ; :r^^;™-x A i^-:ccTAACAAACTACGCGG 4 7 2 

? - « • ™= m ATTrCCT A TT C -^-;-^^-;^ A ^ A ;;^ A :;:-^ CAA ACTACCAGG 4 7 2 

ga la . Cemi ATTTCCTATTTGCCTACGCCATCC^ACGA^.^A^^^^C^^^^^ ^ ? , 

p-ro.por. ACTTTCTA7^CGCCTACGCC^TG^^A^C^^--^ £ ^^ CTAGCACa 4 ,, 

gala . ma ~ ACTTCTT ATTTCCTTA '^CATCG^^w-A^^ - - cTAGCACG * 7 2 

gala. mo* ^-^JZI^^^ < 72 

0=0. ga c A ~ CCTA l^^ A ;~rJ^"-AC^ 4 7 2 

lor . Car ATTTCCTATTCCCA . AC~jCAA . . ^TAC^A >, .. 
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PRIMER *mcb3S3 1 



rACGCCATCCTTCGATCAATCCCCAACAAACTAGGAGG 



rCCCCAATAAACTAGGAGG 

;taacaaactaccagg 



ATTTTCTATTCGCC: 

ATTTGCTATTTGCATACGCCATTCTACGC: 
ATTT C CT ATTTGC CT ACG C AATTCTC CG ATCTG1 
ATTTCCTATTCGCCTACACAATTCTCCGATCCGTCCC 

ACTTTCTATTCCGATACGv. . - ~ - ^ ^ - - - - 

ACTTCCTTTTTGCTTACCCCATTCTACGATCTCTACCAAACAAACTACGAG. 

ATTTCTTATTTGCCTArGCCATCCTrGGC~CAATCGCAAACAAACTACGAGw 

ACTTCTTATTTGCGTACGCCATCCTTCGTTCAATCCGCAACAAACTAGGAGG 

ACTTCCTATTCGCTTATGCCATCCTGCGCTCAATCCCAAACAAACTTGGwvjw 

ACTTCCTATTCGCCTACGCCATCCTACGCTGAATCCCAAACAAACTTGGAGG 

ATTTCCTGTTCGCT7ATGCCATCCTACGCTCAA7CCCAAACAAACTCGGAGG 

A^rrrCTGTTCGCT-ArGCCATCCTGCGGTCAArrGCAAACAAACTCGGAGG 

ac-^cctattcgcctatgcc^tcctaccctcaatcccaaacaaactacgac-g 

ACTTCTTATTTGCCiACGCTATCCTACGCTCAATCCCAAATAAACTCGGAGG 

acttcctatttgcctaxgctatcctgggctcaatcccaaataaactcggagg 

ACTTCGTATTrCCCTACGCCATCC-ACGCTC^rCCCAAACAAACTGGCGGG 
ACTTCCTATTCGCG~ATGCTATCGTACGATC\ATCCCCAACAAACTCGwAGvj 
ACTTCCTArrrGCCTACGCCATCCTACGCrCAArCCCCAACAAACTCGGAGG 
ACTrCCTArTrGCCTACGCTA-TCTACGCTCAATCCCCAATAAACTTG^AGw 
A^TTCTTATTTGGGTATGCCATCCTACGTTCAArTCCAAAGAAACTAGGAGw 
^CT-ATTTGCA-ACGCaATCCTACGrrCAATTCCAAACAAACTAGGAGv, 

ACTTTTTATTTGCArACGCCATCCTACGTTCAATCCCAAACAAACTAGG^G^ 
AC^TTTTA' 7 ~^GCATACGCCATCCTACGTrCAATCCCAAACAAACTAGv»AG« 
ACTTCCTAT^GCAXACGCCATCCGACGTTGAArCCCAAAGAAACTAGGAGvj 

ACTT^TTATTTGCCTACGCCATCTTACGCTCAATCCCAAACAAACTAGGAG»j 
ACT^TSATTTGCCTACTCCATCTTACGCTCAArCCCAAACAAACTAGGAGw 

ACTTTTTATTTGCCTACGCCATCTTACGCTCAArCCCAAACAAA 
ACTTTTTATTTGCnTACGCCATCCTACGTTCAArCCCAAAw^AACTAGGAGw 

ACTTCTTATTTGCATACGCTATTCTGCGTrCAArCCO^^CAAACTA « 
ACTTCCTCTTTGCATACGCCArCCTACGCTCCArCCCCAACA^C^A « w 
ATTTCCTATTCGCrTACGCCATCTTACGCTCCATCCCwAACAAACT 
ArTTCCTATTCGCATATGCCATCCTACGCTCAArCCCCAATA^CrAG^G^ 
AC^CCTArTCGCCTACGCCATCCrrACGCTC^rCCCCAACAAACTAGG :;:: 
ACTTCCTATTTGCCTACGCCATCGTACGCTCAATCCCCAACAAACTGG 
ACrTCCrATrTGCTTAGGCCATCCTACGCTCAA^CCC^^CTG^C.G 
ACrrCCTATrrCCCTACGCTATTCTACGC^CAArrCCC^C^^AC.CGG 
ACrTCCTA~CGCCTACGCCATCCTGCGATCAArCCCGAATA^ACTAooAGG 
A^TTTTTATTTGCATACGCTATTCTGCGATCAATTCCAAACAAACTAGGAGG 



ACTTTCTTT 
ACTTTCTTT 
ACTTCCTA 
AC 



•ATTCGCCTACGCCATCCTACGATC: 
CGCTTACGCAAT 



rCCCTACGCAATCCTACCArCAA 
'ACGCAATCCTACGA 



TTGCC 
TCTTATTTCCCTACGCCAT 



rCCCAAACAAACTAGGACG 
:Z CC AAACAAATTAGGTGG 
rCCCAAACAAATTAGGAGG 

— ACG^^^CCAAACAAACTAGGCGG 
CCATCAATCCCTAACAAACTAGGAGC 
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J3 

8LASTN 2.1.2 (Nov-13 -2000] 



Altschul. Stephen F. f Thomas L. Madden, Alejandro A. Scha.fer, 
Jinghui Zhang. Zheng Zhang. Webb Miller, and David J. Liprnan , 
-Gapped BLAST and PSI-BLAST: a new generation of protein database a^arctt 
programs-. Nucleic Acids Res. 25:3389-3402. 



RID: 984S92689-1224-27770 } 
Query- 

(323 letters) 



Databasa: Sequences from complete mitochondrial genomes 
129 sequences; 3.164,247 total letters 

If you have any problems or questions with the results of this search 
please refer to the BLAST ?AQs 



TaYrtncaiv report 3 

Distribution of SO Blast Hits on the Ouerv Sequenc e 



|Mouse-ove r to show defline and scores. Click to show alignments 
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Sequence* producing significant alignments: 
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Felis cacus mi cochondrion . complete genome 
Phoca viculina mitochondrion, complete genome 
Canis Camiliaris mitochondrion, complete g... 
Halichoerus grypus mitochondrion, complete... 
Cavia porcellus complete mitochondrial genome 
Ceratotherium simum mitochondrion, complet . . . 
tty.oxus glis mitochondrion, complete genome 
Equus asinus mitochondrion, complete genome 
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Matrix: blastn matrix; 1 -3 

Gap Penalties: Existence: 5, Extension: 2 

Number of Hits to OS: 788 

Number of Sequences: 129 

Number of extensions: 788 

Number of successful extensions: 163 

Number of sequences better than 10.0: 77 
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AFOS3039 727 a C c ^ 

AFOUfll? 727 a C c "' 

AF0.S1037 727 a C c ' * 

AF0530 36 727 a C c **" 

AFOSASJi 727 . a C * • • 7gfi 

AFJ353034 727 a C <= ^ 

aVo53033 727 a.... t <= " 

AFQ53032 727 a C <= 

AF0530U 727 a C C 

AFQS3030 72V a t C ^ 

AF0S3029 727 a C <= 

AFQS3J.23 727 a C C ■ • • 

AF05 3 027 727 a C C 

AF0S3026 727 a C C c mi* 

AFOS 3 ,020 727 a t C c. . . 7« 

AFOS3Q19 727 a C c C. . . 785 

AW53043 727 a C « ; ; C. . 7JJ 

X32301 727 a c <= .-.c... 

AFQ530S2 727 a t C t.C.C t 735 

X3230O 727 a.. C C . C - c . . . c 

A3004233 727 C C a. .eg a 

A3004237 727 t C a..C a ^ 

X3229S 727 r..C C a..C a ^" 15323 

NC 001700 157S4 C C a..c a & ' " 1SS23 

U20753 IS7S4 C c a - c *" * " 630 

AF125145 S97 C * c C.e...... ^ 

AF125144 597 ..t C C - a..c C... ^ 

AF1S497S 727 ..a C C.g a..C c * " a c?a5 

A30S1237 727 ..a c C.g a..c . .c • g _ 

AF12S149 S97 C CC a C.C 

cmpseq_0 301 aatacccccccccawaccaagccrgaac 323 

AY005309 339 355 

AF053Q54 737 . .C 803 

AFOS30S3 737 . .c 803 

AF0S3050 737 t c 81 * 

AF0S3049 737 t C 814 

AF0S3025 737 C C 814 

AFOS3024 737 C c 814 

AFQ53023 737 C C 314 

AF0S3022 787 £ <= 814 

AFOS3021 737 t C 314 

AFO53013 737 C C 314 

AF053051 737 C C 814 

AF053043 787 C C 314 

AF0S3047 737 - <= 814 

AFOS304S 787 t C 814 

AF0S3045 787 C C 814 

AF0S3044 737 C C- 814 

AF053042 737 C C 814 

AFOS304 1 737 C C... 814 

AF05304O 737 C C... 814 

AF053Q39 737 C C... 814 

AFOS3033 737 C <= 314 

AF053037 797 C C 314 

AFOS3035 737 C C 814 

AF0S303S 787 C C... 914 , 

AF053034 797 C C 814 

AF0S3033 737 C C... 814 

AFOS3032 737 C C... 814 

AF0S3O3 I 737 C C...- 814 

AF053030 787 C C . . . . 814 

AFOS302? 737 C c... 814 

AF053023 737 C C... 814 

737 t C . . . . 3 14 

AF05302^ 797 t C... 

AFQSIOZQ 797 C C... 9X4 



.C C . . . 914 



AF7^fjQ43 797 C C- . . . 914 

M;JJ?L 797 C C 914 
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Af9S .?.9S2 787 .gc •» 

win 7a7 -9c • • 

HSJ2i2S0 15824 * * 

SSfllSJ 15824 1535 

AP125144 657 ..C 

*f 154 978 787 ..C..»..a J™ 

aaasi237 787 ..e..a..a ■ 

nF^2S149 657 ..c 



Database: nc 

Posted date: Mar 2, 2001 12:20 AM 
Number of letters in database: 2 < 9 "; 8 ^' 835 
Number of sequences in database: 807, S97 

Lambda K H 

1.37 0.711 1.31 

Gapped 

Lambda K H 

!^37 0.711 1.31 



Matrix: blastn matrix: 1 -3 

Gap Penalties: Existence: 5. Extension: 2 

Number of Hits to OB: 46QS42 

Number of Sequences: 807597 

Number of extensions: 460542 

Number of successful extensions: "*71 

Number of sequences better than 10.0: 6437 

length of query: 328 

length of database: 2, 863 , 827, 88S 

effective HS? length: 20 

effective length of query: 308 

effective length of database: 2,847,575.945 

effective search space-. 877084191060 

effective search space used: 877034191060 

T: 0 

Ar 30 

XI: 6 (11-9 bits) 
X2: 15 (29.7 bits) 
Sis 12 (24.3 bits) 
S2: 19 (38.2 bits) 
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Table S. Reference animals and the allocated code numbers included in the study 



SN. Code number 


Name of the animal 


Zoological name 


A 

1 




Indian tiaer 


Panthers tioris tioris 


2 


bhz26t 


If tuiai i ti^ci 


■ Panth&ra tinris tinris 


3 


bhz30t 


inuian uger 


ranznera ugns ugns 


4 


bhz45t 


Indian tiger 


Panthera tigris tigris 


5 


bhz56t 


Indian tiger 


Panthera tigris tigris 


6 


bhz63t 


Indian tiger 


Panthera tigris tigris 


7 


bhz20wt 


Indian white tiger 


Panthera tigris bengalensis 


8 


bhz22wt 


Indian white tiger 


Panthera tigris bengafensis 


9 


bhz23wt 


Indian white tiger 


Panthera tigris bengalensis 


10 


bhz28wt 


Indian white tiger 


Panthera tigris bengafensis 


11 


gzll 


Normal leopard 


Panthera pardus 


12 


gz2l 


Normal leopard 


Panthera pardus 


13 


gz3l 


Norma! leopard 


Panthera pardus 


14 


gz21c! 


Clouded leopard 


Neo felis nebulosa 


15 


gz22d 


Clouded leopard 


Neofelis nebufosa 


16 


darz14sl 


Snow leopard 


Panthera unicia 


17 


darz15sl 


Snow leopard 


Panthera unicia 


13 


darzl 6sl 


Snow leopard 


Panthera unicia 


19 


sbz22al 


Asiatic lion 


Panthera tea persica 


20 


sbz38a( 


Asiatic lion 


Panthera leo persica 


21 


sbz39al 


Asiatic lion 


Panthera leo persica 


22 


humsk 


Human 


Homo sapiens sapiens 


23 


chimss 


Chimpanzee 


Pan sp. 
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sbz22al 

sbz33al 

sbz39ai 

adil. flesh 

gzlni 

gz2nl 

gz3nl 

bhz23wt 

bhz28wt 

bhz22wt: 

bhz20wt 

bhzS6c 
bhs2Sc 
bhz30c 
baz4Sc 
bh=2St: 
d=14sl 
dzlSsl 
dzl5sl 
gz21cl 
gz22cl 
chimss 
humsJc 



TGAATCTCAGGAGCCTTC~ 
TGAATCTGAGCACCCTTCTCAC 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAC 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG' 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG 
TGAATCTGAGGAGGCTTCTCAG' 
TGAATCTGAGGAGGCTACTCAG 
TGAATCTGAGGAGGCTACTCAG 



TAGACAAACCCACCCTGACACGATTCTT rCCC. *CCAC *0 

STAG ACAAAGCCACCCTGACACG ATT CTTTGCCTTCCAC 6 0 

•AGACAAAGCCACCCTGACACGATTC1 i :CCCw.CCAC 50 

TAGACAAAGCTACCCTGACACGATTCTTTGCC i iCCAC 60 

TAGACAAAGCTACCTTGACACGATTC I * TGCCTTCCAC 6 0 

TAGACAAAGCTACCTTGACACGATTCTTTGCCTTCCAC SO 

ITAGACAAAGCTACCTTGACACGATT CTTT GCCTTCCAC 60 

TAGACAAAGCCACCCTGACACGATTCTTTGCCTTCCAC 60 

.TAGACAAAGCCACCCTGACACGATTCTTTGCCTTCCAC 50 

ITAGACAAAGCCACCCTGACACGATTCTTTGCCTTCCAC 60 

TAGACAAAGC CAC CCTCACACGATT CTTT G C CTTC CAC 60 

TAGACAAAGC CACCCTGACACGATT CTTT GC CTTC CAC 60 

TAGACAAAGCCAC C CTGACACGATTCTTTGCCTTC CAC 60 

JTAGACAAAGCCACCCTGACACGATTCTTTGC C 1 1 CCAC 6 0 

TAGACAAAG C CAC C CTG ACAC G ATTCTTTGC CTT CCAC 60 

fTAGACAAAGCCACCCTGACACGATTCTTTGCCTTCCAC 60 

fTAGACAAAGCCACCCTGACACG ATTCTTTGC CTTC CAC 60 

iTACACAAAGCCACCCTGACACGATTCTTTGCCTTCCAC 60 

fTACACAAAGCCACCCTGACACGATTCTTTGCCTTCCAC 60 

TACACAAAGCCACCCTGACACGATTCTTTGCCTTCCAC 60 

TAGACAAAGCCACCCTGACACG ATTTTT CGCCTTCCAC 60 

TAGACAAAGC CACC CTGACACG ATTTTTCGCCTTCCAC 60 

JAGACAGC CCTACCCTTACACGATT CTTC AC CTTCCAC 60 

TAGACAGTCC-CACCCTCACACGATTCTTTACCTTTCAC 60 



sbz22al 

sbz33al 

sbz39al 

adil - flesh 

gzlnl 

gz2nl 

S z3nl 

bfl=23wc 
bhz2 3wt 
bhz22wt: 
bhz2 0vc 
bhz63c 
bhzS6~ 
bhz2S:: 
bhz3 0c 
bhz45c 

dzl43l 

dzlSsl 
dzlGal 
gc21cl 
gr 22c L 
chime a 



TTCATCCTTCCATTTATCATCTCAGCCCTAGCAGCAGTCCACCTCCT 
TTCATCCTTCCATTTATCATCTCAGCCCTAGCAGCAGTCCACCTC^^ 

TTCATCCTTCCATTTATCATCTCAGCCCTAGCAGCAGTCCACCT>- C j. ^^^^^^r^lpV^ 
TTCATCCTTCCATTTATCATCTCAGCTCTAGCAGCAGTCCACCTC «- i ^^^ v- 2 < ^~^^^^ 
TTCATCCTTCCATTTATCATCTCAGCTCTAGCAGCAGTCCACCTC ^^^^^^^ ^AG 
TTCATCCTTCCATTTATCATCTCAGCTCTAGCACCAGTCCACC^ 
TTCATCCTTCCATTTATCATCTCAGCTCTAGCAGCAGTCCACCTCCTA^^ 

TTCATCCTTCCATTTATCATC^ 

TTCATCCTTCCATTTATCATCTCAGCCCTAGCAGCAGTCCACCTCC^A^ 

TTCATCCTTCCATTTATCATCTCAGCCCTAGCAGCAGTCCACCTCCTA^^^ 

TTCATCCTTCCATTTATCATCTCAGCCCTAGCAGCAGTCCACCTCCTATT^CTC 

TTCATCCTTCCArTTATCATCT^ 

TTCATCCTTCCATTTATCATCTCAGCCCTAGCAGCAGTCCACCTC w ^ ~^^^V«-r^r 
TTCATCCTTCCATTTATCATCTCAGCCCTAGCAGCAGTCCACCTCw l A — 
TTCATCCTTCCATTTATCATCTCACCCCTAGCAGCAGTCCACCTCC^ 

TTC^TCCnTCCATTTATCATCT^ 
TTCATCCTTCCATTTATCATCTCAGCCCrAC^ 

TTCATCCTTCCATTTATCATCTCAGCCCTAGCAGCAGTCCACCTC^ - ^^^^X^^r^'T-r AG 

TTCATCCTTCCATTTATCATCTCACCC^^ 

TTCATCCTTCCATTTATCATC^^ 

TTCATCCTCCCATTTATCATCTCAGCCTTAGCAGCAG— ^ Ct ^^I 1^^-^-AA 
TTCATCCTCCCArrrATCATCTC\GCCTTAGCAGCAGTTCACC ! I - '^^^^^^ 
TTTATCTTACCCTTCATTATCACAGCCCTAACAACAC— ^ -—^^Xll^^AA 
TTCATCTTGCCCTTCATTATTGCAGCCCTAGCAGCACTCCACCTC ~ * a ^ 



120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
12 0 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 



ab:34al 



ACACCATCTAATAACCCCTCAv^CAATCGTATC 

acaccatctaataaccccttag^aatcctatc 



tcact-^cataaaatt' 



TCCATCCA 13 0 
-TTCATCCA 1^0 
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6b =3 9a 1 ACACCATCTAATAACCCCTCAGCAATGGTATCTCACrCACATAAAATTCCArrCCA.CCA A 3 0 

adil • flesh ACAGGATCTAACAACCCCTCAGGAA-TAGTATCCGACTCAGACAAAATTCCA. -CCAC.^ x30 

g . lal ACAGGATCTAACAACCCCTCAGGAATAGTATCCGACTCAGACAAAATTCCATTCv.ACv.y-A .3 0 

c - 2 n 1 ACAGGATCTAACAACCCCTCAGGAATAGTATCTGACTCAGACAAAATTCCA . . CCACCCA 1 3 j 

g - 3Rl ACAGGATCTAAaVACCCCTCAGGAATAGTATCTGACTCAGACAAAATTCCA^CCACCCA 13 0 

bh=23 w: ACAGGATCTAACAACCCCTCAGGAATAGTATCTGACTCAGACAAAATCCCGj^t CCACCCA 13 0 

bh-^ 3wc ACAGGATCTAACAACCCCTCAGGAATAGTATCTGACTCAGACAAAATCCCGj^t CCACCCA 130 

bh=22wc ACAGGATCTAACAACCCCTCAGGAATACTATCTGACTCAGACAAAATCCCG * i CCACCCA 130 

bh'-'Owc ACAGGATCTAACAACCCCCCAGGAATAGTATCTGACrCAGACAAAATCCCGTTCCACC'CA 130 

b ---,S3- ACAGGATCTAACAACCCCTC^GGAATAGTATCTGACTCAGACAAAATCCCGTTCCACCCA 13 0 

bhzSSt ACAGGATCTAACAACCCCTCAGGAATAGTATCTGACTCAGACAAAATCCCGTTCCACCCA 13 0 

b b--»SC ACAGGATCTAACAACCCCTCACGAATAGTATCTGACTCAGACAAAATCCCCTTCCACCCA 180 

bh~3 0C ACAGGATCTAACAACCCCTC^GGAATAGTATCTGACTCAGACAAAATCCCGTTCCACCCA 13 0 

bh ~4SC ACAGGATCTAACAACCCCTCAGGAATAGTATCTGACTCAGACAAAATCCCGTTCCACCCA. 13 0 

b -'Z -> s - ACAGGATCTAAC2AACCCCTCAGGAATAGTATCTGACTCAGACAAAATCCCGTTCCACCCA 1 3 0 

d-Hsl ACAGGATCTAACAACCCCTCAGGAATAGTATCTGACTCAGACAAAATCCCGTTCCACCCA 13 0 

d=lS S 1 ACAGGATCTAACAACCCCTCAGGAATAGTATCTGACTCAGACAAAATCCCGTTCCACCCA 1 3 0 

d _ 15sl AC2AGGATCTAAO^ACCCCTCAGGAATAGTATCTGACTCAGACAAAATCCCGTTC^C_^ 130 

C-21C1 A CAC^ATCCAATAACCCCTC\GGAATGGTATCCGATTCAGACAAAATCCCGTTC^C_^ --0 

c=2 2 c i ACAGGATCCAATAACCCCTCAGGAATGGTATCCGATTCAGACAAAATCCCGTT^~Aww-vi - 3 0 

c h « ms S ACAGGATCAAATAACCCCCTGGGAATCACCTCCCACTCCGACAAAATTACCTTCCACww- - 3 0 

bumsk ACGGGATCAAACAACCCCCTAGGAATCACCTCCCATTCCGATAAAATCATCTTCCAC^-i 130 

* * » W * * * * * * ****** ***** * * * ** * ♦ ***** ***** *« 

sb=22al TACTATACAATCAAAGATACCCTAGGCCTTCTAGTACTAATCTTAAOACTCATACTACTC 240 

sbz33al XACTATACAATCAAAGATATCCTAGGCCTTCTAGTACrAATCTTAACACT -40 

sb~ 3 9al TACTATACAATCAAAGATArCCTAGGCCTTCTAGTACTAATCTTAACACTCA.AC^AC.w 2,0 

adil . flesh TACTACACAATCAAAGATATCCTC<WCCTTCTAGTACTAATCCTAGCACT^ - 0 

^ 1 TACTACACAATCAAAGATATCCTGGGCCTTCTAGTACTAATCCTAGCAC rCA^CTACTC 2 4 0 

ZZ^l T ACTAC\CAATCAAAGA(2ATCCTGGGCCTTCTAGTACTAATCTTAGC2ACTCATACT^CT^ 240 

==3=1 XACTACAC2AATCAAAGACATCCTGGGCCTTCTAGTACTAATCTTAGCACTCATACT 240 

bb=23w= XACXACACZAAXCAAAGACACCCTGOSCCTX^ ;« 

bh223wc T ACTACACAATCAAA/»»TCCTGGGCCTTC^^ -*« 

bh=22wc XACTAC2ACAATCAAAGAC^TCCTGC^CCTTCTAGTACX.AATCCTAACACT^-^-C^ : -40 

bh=2Cwc XACXACAC^XCAAAGA<2ATCCTGGGCcrrTCTAGXACTAATCCXAACA^ -JO 

bh2S3 - XACTACACAATCAAAGACATCCTGGGCCTTCTAGTACTAATCCTAACACTCA^AC^ACXw .4 0 

bb - 55 - XACTACACAATCAAAGACACCCTGGGCCTTCTAGTACTAATCCTAACACTCA . «CTnw - - 



b h= 2 2 w: XACXACACAAXCAAAGACATCCTGGGCCXXCTAGXACrAATCCTAACAC . -"^^ C _2- 

bh=2Cwc XACXACAC^XCAAAGA<2ATCCTGGGCcrrTCTAGXACTAATCCXAACA^ -JO 
XACTACACAATCAAAGACATCCTGGGCCTTCTAGTACTAATCCTAACACTCAi.AC^ACXw .,0 

TACTACACAAXCAAAGACATCCTGGGCCTTCXAGTACTAATCCTAACACTCA - ~ , , 

bh=2Sc XACTACACAAXCAAAGACAXCCTGGGCCTTCTAGTACXAATCCXAACACr -40 

b hs 3 0 c XACXACACAAXCAAAGACATCCTGGGCCTTCTAGTACTAATCCTAACACTCA^ACj^ACT^ - •* u 

bh=4 5= XACTACACAAXCAAAGACATCCXGGGCCXTCXAGTACTrAATCCTAACACTCA^ACTACj.^ -4 0 

bh=2Sc TACTACACAATCAAAGACA-CCTGGGCCTT CTAGTAC-TAATCCTAACACTCA . ACT ACT- 2 40 

d = 3 4 s 1 TACTACACAATCAAAGACATCCTGCGCCTTCTAGTACTAArCCTAACACTCA t ACTACT- . 

d= i S S 1 TACTACACAATCAAAGACATCCTGGGCCTTCTAGTACTAATCCTAACAC . CA.AC_.AC-. - 4 u 

ds 1« » 1 TACTACACAATCAAAGACATCCTGGGCCTTCTAGTACTAATCCTAACACTCA . ^^CT- - 

g= 2 1 c 1 TACXAXA<2AAXCAAAGATATCCTAGGCCrCCTAGTXCTAArrCTAGCGCT.^^u^C^ - « 

g-2 2cl TACTATACAATCAAACATATCCTACCCCTCCTAGTTCTAATT CTAGCCCT^^^----- - J 

chims 3 TACrA<2A<^XCAAAGATA-CCTTGGCTTATTCCrrTTTCCTCCTTAT ; 0 

fumsk TACTACACAATCAAAGACGCCCTCGGCTTACTTCTCTTCCTTCTCT ~wA_A..A J 



s5 = 2 2a 1 GTCCrATTCTCACCAGACCTATTAGCAGAXCCCGACAAtrTATACCCCCGs.-^. - -_-A 

sb = J3.iL GTCCTATTCrCACCAGACCTATTAGCAGATCCCGACAACTATACGGCCGC^^-^-- ^ 

Si3 = 3 9a 1 GTCCTATTCTCACCAGACCTATTAGGAGATCCCGACAAv. . ATACCCCCGC_^A . , - . - 



adil - flesh CTCCTATTCrCACCACACCTGTTAGGAGACCCCGATAACTACATCv.-.GC^CC^^ 

g-lal GTCCTATTCTCACCACACCTGTTAGGAGACCCCGATAACTACATCCCTGC^C^^-.^ 1 

g- 2n l CTCCTATTCTCACCAGACCTGTTGG-AGACCCCGATAACT ACATCCCCSC-AACww.^ J 

a - 3 a 1 GTCCTATTCTCACCAGACCTG-rTGCGAGACCCCGArAACTACATCCCCGCCAAC _ A i 

toh - ; 3 w- CTCCTATTCTCACCAGACCTATTAGGGCACCCCG ATAA>— ACATTCCCGCCAAC A 
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bhz2 3 wt GTCCTATTCTCACCAGACCTArrACGCGACCCCGATAACTACATCCCCGCCAACCCTCTA 3 0 0 

bhz22wt GTCCTATTCTCACCAGACCTATTAGGGGACCCCCATAACTACATCCCCGCCAACCCTCTA 3 00 

bhz2 0 wC GTCCTATTCTCACCAGACCTATT AGGCGACCCCGATAACTACATCCCCGCCAACCCTCTA 3 0 0 

btl z63 1 GTCCTATTCTCACCAGACCTATT AGGGGACCCCG ATAACTACATCCCCGCCAACCCTCTA 3 CO 

bh I 5 6 <- GTCCTATTCTCACCAGACCTATT AGGGGACCCCG ATAACTACATCCCCGCCAACCCTCTA 3 0 0 

bh I 2 6 Z GTCCTATTCTCACCAGACCTATTACGGGACCCCGATAACTACATCCCCGCCAACCCTCTA 3 0 0 

b hz3 0t GTCCTATTCTCACCAGACCTATTAGGGGACCCCGATAACTACATCCCCGCCAACCCTCTA 3 00 

bh 2 4 s c GTCCTATTCTCACCAGACCTATT AGGGGACCCCG ATAACTACATCCCCGCCAACCCTCTA 3 0 0 

bh ^2 S C GTCCTATTCTC^CCAGACCTATTAGGGGACCCCGATAACTACATCCCCGCCAACCCTCTA 3 00 

dz 14 s 1 GTCCTATTCTCACCAG ACCTATTAGGGG ACGCCGATAACTACATC CC CGCCAACCCTCTA 3 0 0 

d-lSsl GTCCTATTCTCACCAGACCTATTAGGGGACGCCGATAACTACATCCCCGCCAACCCTCTA 3 0 0 

d~ 16 s 1 GTCCTATTCTCACCAGACCTATTAGGGGACGCCGATAACTACATCCCCGCCAACCCTCTA 3 0 0 

g~->lcl GTTCTATTCTCCCCAGACCTACTAGGAGACCCTGACAATTACACTCCCGCCAACCCTCT 3 00 

g- 2 2 cl GTTCTATTCTCCC CAGAC CTACTAGGAGACCCTGACAATTACACT CCCGC CAACCCTCTA 3 0 0 

chims s ACACTATTCTCACCAGACCTCCTGGGCGATCCAGACAACTATACCCTAGCTAACCCCCTA 3 0 0 

hums* ACACTATTCTCAC CAG ACCTC CTAGG CG AC C CAG ACAATTATAC C CTAG C CAAC CC CTTA 3 00 



sbz22al 

sbz33al 

sbz39al 

adil. flesh 

gzlnl 

gz2nl 

gz3nl 

bhz23wt 

bhz2 3wt: 

bhz22w<c 

bhz2 0wt: 

bhzS3C 

bhzSSt 

bhz26c 

bhz30c 

bhz4Sc 

bhz2SC 

dz!4sl 

dzlSsl 

izlSsl 

gz21cl 

j=22cl 

^hitnss 

iums5c 



AGCACCCCTCCCCATATCAAACCTGAAT 3 23 
AGCACCCCTCCCCATATCAAACCTGAAT 32 3 
AGCACCCCTCCCCATATCAAACCTGAAT 32 3 
AATACCCCTCCCCATATCAAGCCTGAAT 32 3 
AATACCCCTCCCCATATCAAGCCTGAAT 323 
AATACCCCTCCCCATATCAAGCCTGAAT 32 3 
AATACCCCTCCCCATATCAAGCCTGAAT 323 
AACACCCCTCCCCATATCAAGCGCGAAT 32 3 
AACACCCCTCCCCATATCAAGCGCGAAT 32 3 
AACACCCCTCCCCATATCAAGCGCGAAT 32 3 
AACACCCCTCCCCATATCAAGCGCGAAT 32 3 
AACACCCCTCCCCATATCAAGCGCGAAT 32 3 
AACACCCCTCCCCATATCAAGCGCGAAT 32 3 
AACACCCCTCCCCATATCAAGCGCGAAT 32 3 
AACACCCCTCCCCATATCAAGCGCGAAT 32 3 
AACACCCCTCCCCATATCAAGCGCGAAT 3 2 3 
AACACCCCTCCCCATATCAAGCGCGAAT 32 3 
AACACCCCTCCCCATATCAACCCCGAAT 32 3 
AACACCCCTCCCCATATCAAGCCCCAAT 3 2 3 
AACACCCCTCCCCATATCAAGCGCGAAT 323 
AATACCCCTCCCCATATCAAGCCTGAAT 32 3 
AATACCCCTCCCCATATCAAGCCTGAAT 32 3 
AACACCCCACCCCACATTAAACCCGAAT 3 2 8 
AACACCCCTCCCCACATCAAGCCCG AAT 3 2 3 
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Table 7d 



Position |273j276|279|282|284|285|2B7(28a|29t 294|297 298 302 



add. flesh 1 C TjCiC T • C ; C T ; C 
gzU 




C ■ T . c : A 



bhz2St . C ' : 

bhz26t i ... i c ! 

bhz30t' : .* r : c r 

bhz45t . . ' ; ■ . ' . . r c j 

bhz56t • C [ 

bhz20wt ... i f c r 

bhz22wt > . ! c 

bhz23wt ^ . . j . . : . ! . ; q ! 

sbz22al [ . C 

sbz38al r . * C : . IT C I 

gz21cl " m '2l T ' C ' T ! . : C : T~ '." r C"l 

gz22cf~~TT C." T cTtTV Tc": 



323 324 



T T T : C . G C T 
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TaUc to 

BLAST* 2.1.2 [Hov-13-2000] 

Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997). 
-Gapped BLAST and PSI-BLAST: a new generation of protein database aea.cn 
programs-. Nucleic Acids Res. 25:3389-3402. 

RID: S8459169S-1007S-1360S 

Query- 

(25 letters) 

Database: nt „ , 

B07,597 sequences; 2, 863, 827, BBS total letters 

If you have any problems or questions with the results of this search 
please refer to the BIAST FAQs 

Taxonom y reports 

retribution of 500 Blast Hits on the Query Sequence 



[Mouse-over to show defline and scores. Click to show alignments 
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Sequences producing significant alignments: 

9 b | AF2 3165 1 . 1 1 AF2 3 1 6 SI Scrongylura notata clone HB-82 cytoc... 
gbt AF2316S0.1 1AF231650 Scrongylura notata clone H3-1S9 cyto . . . 
ref i NC 002672 . 1 1 Dinornis giganceus mitochondrion, complete... 
reJfJWC 00267 3 . 1 1 Emeus crassus mitochondrion, complete genome 
ab I AF2 32 0 1 s". 1 i AF2 3 2015 Nothrocheriops shastensis cytochrome... 
q bl AF2 3 2Q13 . 1 1 AF23 201 3 Bradypus variegatu3 cytochrome b gen... 
gbj AY01 601S . Xj Emeus crassus mitochondrion, complece genome 
qblAYO 1*6 013 . 1 \ Dinornis giganceus mitochondrion, complete g... 
ab \ AY016014 . 1 1 Dromaius novaehollandiae mitochondrion, part... 
gb | AF23 0167 . 1 1 AF23 0167 8onasa umbellus. cytochrome b (CTT3) . . . 
c'b| AF074594 . 1IAFQ74594 Baeolophus bicolor cytochrome b gene. . . 
ab | AY005210 . 1 ! Poospira melanoleuca isolate 3 cytochrome b . . . 

"~" poospira melanoleuca isolate 2 cytochrome b . . . 

Poospi=a melanoleuca isolate' 1 cytochrome b 
Poospiza hispaniolensis cytochrome b (cytb) . . . 
Poospiza garleppi cytochrome b (cytb) gene, . . - 
Poospisa erythrophrys cytochrome b (cytb) ge . . . 
Poospiza boliviana cytochrome b (cytb) gene. . . . 
Poospiza alticola isolate 2 cytochrome b (cy. . - 

_ Poospiza alticola isolate 1 cytochrome b ( cy . . . 

cb 1 AF15 5870 . 1 i AF15537Q Heterocephalus glaber cytochrome b ( 

gb(Arl89123 .1IAF189123 Glypcotermes eukalypti cytochrome b . . . 
cb j AF1020 99 . 1 } AF1Q2099 Crinif eroides leuccgaster cytochrome... 

cb I AF1020 9S . 1 1 AF10209SS1 Corythaixoides concolor cytochrome 

cblAF27l065 . 1 1 AF271Q6S Mustela ermine a specimen-voucher AF1 

cb I AF243 8 57 . 1 I AF2438 57 strongylura notata nottta cytochrome. . . 
cbi AF243856 . 1 ( AF2438 56 Scrongylura notata forsythia cytochr. . - 
ref 1 NC 001567 . 1 1 Bos taurus mitochondrion, complete genome 
cbl AF306872 . ll AF306872 Brachyramphus marmoratus hap lo type M. . . 

Brachyramphus marmoratus haplotype M. - - 



c bl AY005209.ll 
cb|AY005208.1( 
cblAY00520S.li 
cb |AY005204 .ll 
cbl AY005203 .11 
cbf AY0OS201.lf 
g b[AY005199.li 
cbiAYQQ5193.il 



cbl AF3 06871 .1) AF306871 
cb | Aj 3 068 7 0 . 1 I AF3 068 70 
cbl AF3 06869.il AF306869 
cbl AF3 068 68 . ll AF306868 
cbl AF0104 06 .11 AF0104 06 
cbl Ar 248662 .11 AF248 662 
cb I AF24 8 661 .1 I AF24 8661 
cblAF096462.1 IAF096462 
cbl AF28 3 644 .HAF283644 
cblAF233643 11IAF283643 
cbl AF28 3 642 . 1 | AF28 3 642 
cbl AF2 8 3 641 . 1 f AF283641 
cbl AF2 8 364 0.liAF28364 0 
cb | AF2 83639 . 1 1 AF2 8 3 63 9 
cbl AF283 63 7. HAF283637 
cbl AF2 8 3 63 6 . ll AF28 363 6 
cbl AF2 8 3 63 5 . 1 I AF2 8 3 63 5 
cblAF2 8 3 634 . 11AF28 3634 
cb I AF2 8 3 63 3 . ll AF28 3 63 3 
cb I AF2 8 3 63 2 . 1 i AF283 63 2 
cb I AF2 8 3 61 1 . 1 I AF2 836 31 
cblAF283630 . 1IAF283630 
cblAF283629. liAF28 3€2 9 
cb I AF2 8 3 62 8 . 1 I AF2 8 3 62 8 
cbf AF283627.HAF283627 
cb I AF28 3 626 . 1 I AF28 36 2 6 
qblAF293625.1lAF283625 
cbl AF2 8 36 2 4 . ll AF28 362 4 
cbl AF28 362 3 . liAF28 3621 
qblAF283S22. UAF283S22 
qblAF2B3621. HAF293621 
ablAF283620. UAF283620 

gblAf^gj^^.tl/Kf^nil? 
gb|Ar^93^19,;|AF^g3^19 



c. . 
c . . 
c. . 
c. . 
c. . 
c. . . 



Brachyramphus brevirostris haplotype 
Brachyramphus brevirostris haplotype 
Brachyramphus brevirostris haplotype 
Ovis aries complete mitochondrial ge 
Gryllus campestris haplotype 2 cycoc 
Gryllus campestris haplotype 1 cycoc 
Rhipidura albicoilis cytochrome b ge 
Elaphe obsoleta cytochrome b gene 
Elaphe obsoleta cytochrome b gene 
Elaphe obsoleta cytochrome b gene 
Elaphe obsoleta cytochrome b gene 
Elaphe obsoleta cytochrome b gene 
Elaphe obsoleta cytochrome b gene 
Elaphe obsoleta LSUMZ 4 535 9 cytochro . . 
Elaphe obsoleta LSUMZ 44662 cytochro. . 
Elaphe obsoleta LSUMZ 4 0443 cytochro. . 
Elaphe obsoleta LSUMZ 443 3 5 cytochro.. 
Elaphe obsoleta LSUMZ 42624 cytochro.. 
Elaphe obsoleta LSUMZ H1911 cytochro.. 
Elaphe ohsoleta LSUMZ 41197 cytochro.. 
Elaphe obaoleta LSUMZ 4 118 9 cytochro. . 
Elaphe obaoleta LSUMZ 41188 cytochro.. 
Elaphe obaoleta LSUMZ 41187 cytochro.. 
Elaphe obaoleta LSUMZ 4 1186 cytochro. . 
Elaphe obaoleta LSUMZ 40943 cytochro.. 
Elaphe obsoleta LSUMZ 37499 cytochro.. 
Elaphe obsoleta LSUMZ 44490 cytochro.. 
Elaphe obaoleta LSUMZ 4-1451 cytochro . . 
Elaphe obaoleta LSUMZ 40444 cytochro.. 
Elaphe obaoleta LSUMZ 3 9925 cytochro.. 
Elaphe obaoleta LSUMZ 37163 cytochro. . 
Elaphe obaoleta LSUMZ 3 9 1 S2 cytochrom.. 
Elaphe obaoleta LSUMZ K1S996 cytochr. . 
Claphe obaoleta LSUMZ H15892 cytochr. . 
Elaphe obaoleta LSUMZ 15891 cytochro.. 
Elaphe obaoleta LSUMZ H15470 cytochr.. 
Claphe obaoleta LSUMZ K15S99 cytochr.. 
Claphe obaoleta LSUMZ H;sa<t<t cytochr.. 
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Table 9. Animals selected far validation of minimum P'S score for efficient amplification 
of DNA templates in PCR ■ 

SL. Name P, S/AFF P. S/AFR 

1 Indian black buck (Antilope cervicapra) gj t 53 S6 t 54 

2 Sheep (Ovis 87.53 96.54 

3 P\q {Sus scrofa) 87.52 87,41 

4 Fresh water dolphin {Pfatanista gangetica ) 86, 49 82. 47 
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gb )AY005209.ll 
ab 1 AY005208 . 1 ! 
cbf AY00S205 . 1 I 
qh j AV005204 .1 I 
cbi AY005203 JjJ 
cbtAY0Q5201.lt 
cb fAY005199.ll 
obi AY00519a . 1 1 



Sequences producing significant alignments; 

gblA£2J165UiJ AF231651 Strongylura notata clone HB-82 cytoc... 
oblAF2316 50. 1 I AF231650 Scrongylura notata clone HB-159 cyto. - . 
ref f NC 002672 . I f Dinornis giganceus micochondrion. complete... 
ref,lMp^Q.??. 6 - 7 -?-'JJ E^eus craaaus mitochondrion, complete genome 
gbl AF232015 . 1 1 AF232Q15 Nothrotheriopa shastenais cytochrome... 
o bi AF23 2013 . 1 I AF2 3 2 013 Bradypus variegatus cytochrome b gen... 
gb Ijftf Sl - fipJlS iJ JLj. Emeus crassus mitochondrion, complete genome 
qb j AY016 013 . 1 I Oinornis giganteus mitochondrion, complete g..* 
ob I AY016014 . 1 1 Dromaius novaehollandiae mitochondrion, part... 
q b I AF2 3 0167 . 1 f AF23 0167 Bonasa umbel lus cytochrome b (CYT3) . . . 
ob! AFQ74 5 94 .1 IAFQ74 5 94 Baeolophus bicolor cytochrome b gene... 
gb I AY0QS210 . 1 L Poospiza melanoleuca isolate 3 cytochrome b . . . 

Poospiza melanoleuca isolate 2 cytochrome b . . . 
Poospiza melanoleuca isolate 1 cytochrome b 
Poospiza hispaniolensis cytochrome b (cytb) . - . 
Poospiza garleppi cytochrome b (cytb) gene, . 
Poospiza erythrophrys cytochrome b (cytb J ge . . . 
Poospiza boliviana cytochrome b (cytb) gene, . . . 
Poospiza alticola isolate 2 cytochrome b (cy. . . 
_ Poospiza alticola isolate 1 cytochrome b (cy. . . 

cbt AF15S87 0 . 1 i AF15537Q Heterocephalus glaher cytochrome b ( . . - 

ob|AF189l23 .1IAF139123 Glyptotermes eukalypti cytochrome b 

cblAF102 0 99. 1 IAF102099 Crinif eroides leucogaster cytochrome 

gb I AF10209S . 1 I AF102095S1 Corythalxoices concolor cytochrome... 
gb [ AF271065 . 1 i AF271065 Must el a ermine a specimen-voucher AF1 . * . 
cbf AF24 3 8 57 . 1 1 AF2438S7 Strongylura nocata nocata cytochrome . . . 

qb|AF24 3 8S6.1 IAF243 856 Strongylura nocata forsyrhia cytochr 

ref INC 001567. l ) Bos taurua mitochondrion, complete genome 

cbf AF3 06372 . 1 f AF3 06372 Brachyramphus marmoratus haplotype M 

Brachyramphus marmoratus haplotype M. . . 
Brachyramphus brevirostris haplotype... 
Brachyramphus brevirostris haplotype . . . 
Brachyramphus brevirostris haplotype . . . 
Ovia arie3 complete mitochondrial ge. . . 
Gryllua campestris haplotype 2 cytoc. . . 
Gryllua campestris haplotype 1 cytoc. . . 
Rhipidura albicollis cytochrome b ge... 
Elaphe obsoleta cytochrome b gene, 
Elaphe obsoleta cytochrome b gene, 
Elaphe obsoleta cytochrome b gene, 
Elaphe obsoleta cytochrome b gene, 
Elaphe obsoleta cytochrome b gene, 
Elaphe obsoleta cytochrome b gene, 
Elaphe obsoleta LSUMZ 4S359 cytochro.. 
Elaphe obsoleta LSUMZ 44662 cytochro,.. 
Elaphe obsoleta LSUMZ 40443 cytochro.., 
Elaphe obsoleta LSUMZ 44335 cytochro... 
Elaphe obsoleta LSUMZ 42624 cytochro. . . 
Elaphe obsoleta LSUMZ H19LL cytochro. 



cb I AF306871 . 1 1 AF3 06871 
cbl A?306870.li AF306870 
cb| AF306869. 1 1 AF3 068 69 
cb| AF3 068 63 .1 ) AF3 06868 
qblAF010406.1 [AF010406 
cb ( AF24 8 662 . 1 I AF2 4 8662 
cbl AF248 661 . 1 1 AF24 8661 
cb| AF0964 62 . ll AF096462 
cbl AF283644 . 1 I Ar 283 644 
cbf AF283 64 3 . 1 f AF28 364 3 
Cb 1 AF28 3 64 2 . 1 I AF28 364 2 
cbl AF28 3 641.lf AF28 3641 
obi AF28 364 0 .l( AF28 3640 
cbl AF2 83 63 9.li AF283639 
cbf AF28363 7. 1 t AF28 3 637 
cbf AF283 63 6 . 1 1 AF283 636 
cbf AF23363 5 . 1 f AF233 63 5 
Qbl AF28 3634 .KAF28 3 634 
cb( AF28 363 3 . 1 I AF28 3 63 3 
cbt AF 28 3 63 2 . 1 ! AF28 3 63 2 
cbf AF283 631 . 1 I AF23 3 631 
cbl AF2836 30 . 1 ! AF2 8 3 63 0 
g ^f AF28 3 62 9. 1 f AF28 3 629 
cbl AF28 362 8 . 1 f Ar 283 62 8 
cbl AF2 3 3 627 . 1 f AF2 8 3 62 7 
abl AF28 3 626 . 1 I AF28 3 626 
cbt AF2 8 3 625 . 1 \ AF28 3 6 25 
cbl AF28 3 62 4 . 1 1 AF28 3 62 4 
Cbl AF2 8 3 62 3 . 1 1 AF28 3 62 3 
cbl AF23 3622 . 1 I AF283622 
cbf AF283621 . I f AF283621 
abl AF28 3 620, if AF28 3620 
q blAF2a3619. UAF28J619 
<7 ^!AF? g i618 . llAF2q3613 
0 blAg-2a3617,HAF283617 
q t ? lAF2 ? 3616.llAF2qJ61S 
qblAF2836!5. 1UF28361S 
qbl AF2«3 614 . 1 1 AF28J614 



c . 
c. 
c . 
c . 
c. 
c. 



Elaphe obsoleta LSUMZ 41197 cytochro... 50 
Elaphe obsoleta LSUMZ 41189 cytochro... 
Elaphe obooleta LSUMZ 41138 cytochro... 
Elaphe obsoleta LSUMZ 41107 cytochro... 
Elaphe obaoleca LSUMZ 41186 cytochro... 
Elaphe obaoleta LSUMZ 40943 cytochro. . . 
Elaphe obaoleta LSUMZ 37499 cytochro... 
Elaphe obsolete LSUMZ 44480 cytochro... 
Elaphe obaoleta LSUMZ 44451 cytochro... 
Elaphe obaoleta LSUMZ 40444 cytochro... 
Elaphe obooleta LSUMZ 3 9925 cytochro... 
Claphe obaoleta LSUMZ 39163 cytochro. . . 
Elaphe obaoleta L5UMZ3 9162 cytochrom. . . 
Claphe obaoleta LSUMZ H15896 cytochr. . . 
Elaphe obaoleta LSUMZ K*wSai2 cytochr. . . 
Elaphe obaoleta LSUMZ 15991 cytochro. . . 
Elaphe obaoleta LSUMZ HIS990 cytochr. . . 
Elaphe obaol-ta LSUMZ H1349? cytochr. . . 
Claphe obaoleca LSUMZ HISaafl cytochr. . . 
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Elaphe obsoleta LSUMI rUS884 cytocnr 
Elaphe obsoleta LSUM2 H1S031 cytochr 
Elaphe obsoleta LSUM2 H15030 cytochr 
Elaphe obsoleta CAS 16 94 68 cytochrom 
Elaphe obsoleta LSUM2 H14782 cytochr 
Elaphe obsoleta LSUMZ H14781 cytochr 
Elaphe obsoleta LSUMZ H1472* cytochr 
Elaphe obsoleta cytochrome b gene, c 
Elaphe obsoleta cytochrome b gene, c 
Elaphe obsoieca cytochrome b gene, c 
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Database: nc 

Posted date: Mar 2, 2001 12:20 AM 
Number of letters in database: 2.363,327,885 
Number of sequences in database: 807,597 

Lambda K H 

1.37 0.711 1.31 

Gapped 

Lambda K H 

1.37 0.7X1 1.31 



Matrix: blascn matrix: 1 -3 

Gap Penalties: Existence: 5, Extension: 2 

Number of Hits to DB: 35355 

Number of Sequences: 807557 

Number of extensions: 39355 

Number of successful extensions: 15066 

Number of sequences better than 10.0: 5706 

length of query: 25 

length off database: 2,863,827,885 

effective KSP length: 17 • 

effective length of query: 8 

effective length of database: 2,850,098,736 

effective search space: 22800789888 

effective search space used: 22800789888 

T: 0 

A: 30 

XI: € (11.9 bits) 
X2; IS (29.7 bits) 
SI: 12 (24.3 bits) 
S2: 16 (32.2 bits) 
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Table 11. BLAST analysis of primers *incb869* in nr database of NCBL It 
demonstrates that the 3' end of this primer is highly conserved among a vast 
range of animal species. It also shows the significant homology among the 
primer and templates (i.e. the cytochrome b gene fragment of different animal 
species), confi rmin g the universal nature of our primer. 
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;ED : 

BLAST* 2.1.2 (Nov-13-2000] 



Altachul, Stephen F* , Thomas L, Madden, Alejandro A. Schaffer, 
Jinghui Zhang. Zheng Zhang, Webb Miller, and David J. Lipman (1997) , 
•Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs-. Nucleic Acids Res. 25 : 3389-3402 . 

RID: 984593033-24247-14777 

Qu«ry- 

(25 letters) 



Database: nt 

807,597 sequences; 2,863,827,835 total letters 

If you have any problems or questions with the results of this search 
please refer to the BLAST FAQ s 



Taxonomy reports 



Distribution of 500 Blast Hits on the Ouerv Sequence 



jMouse-over to show defline and sco res . Click to show ali gnmen ts 
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Sequencer producing significant alignments: 

QbtAF139111.llAF189111 Cryptotecnes auatrinua cytochrome b . 
gblSagfllj . llaasan Phyllocia wolfJaohni MS 3 67270 cynochrcm. 
abf AF123S33 . if AF123S33 Periaaocephalus tricolor cytochrome . 
obi AF123S17 . ll AF123S17 Pipreola arcuaca cytochrome b gene, . 
cb I AF1272Q2 . 1 1 AF127202 Hylopezua fulviventri3 cytochrome b . 
qbf AF12 7194 . 1 1AF127194 Crallaria guacimalensia cytochrome b. 
qbi AF2 17828 .11 AF217828 Aapideiaps acutacus cytochrome b gen. 
qbl AFlS0S7a .1 1 AF1S0579 Hypcgeomys antimena HantSSS cytochro . 
gb | AFQ0 9931 . 2 1 AF00993 1 Archocentrus centrarchua cytochrome . 
ob| AF091S29 . if AF091S29 Ancilocapra americana cytochrome b ( . 
qbl AF0349S7.il Sigmoceroa lichtenatsinii cytochrome b gene, . 
gb | AF038290 . 1 1 AF038290 Antechinus sp. cytochrome b gene, mi. 
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qb| AF0a4Q74 .llAF084074 
ab I AFQ907S0 a 1 1 AF090750 
qb|AF15793 9,llAF157939 
? b|AFlS7937,llAFlS7937 
qb|AFlS7936.l)AF15793<5 
< ? b |AF157915.llAFlS79IS 
cb|AFl57914.llAF157914 
c ? b |A?lS7912.llAFlS79t2 
Qb I AFI 57906 . 1 1 A? 157906 
Qb|AF157a91. 1IAFX57891 
QblAFl57fla2 .HAFl57aa2 
qb | AF157BS9 . 1 I AF157359 
qbf AFlS7fl5a .llAF15785a 
qb|AFl57339.llAF157a39 
ab 1 AF030497 . 1 1 AF030497 



Lagenorhynchus albirostris cycochrcm 
Cobio gobio balcanicus cytochrome b 
Spermophilus columbianus columbianus 
Spermophilus washingtoni isolate S3 9 
Spermophilus washingtoni isolate S3 8 
Spermophilus richard3oni isolate S63 
Spermophilus richardsonx isolate S62 
Spermophilus unduiatus isolate SS0 c 
Spermophilus undulatu3 isolate S55 c 
Spermophilus elegans elegans isolate 
Spermophilus columbianus columbianu3 
Spermophilus citeilus isolate S113 c 
Spermophilus citeilus isolate S117 c 
Spermophilus elegans elegans isolate 
Crocidura brunnea cytochrome b (cyt 
gb[U03 54i -2 I LAU03541 Lenoxus apicalis cytochrome b gene, pa 
gbf AF009951.2 1AF009951 Heros appendicular^ cytochrome b (c 

Tomocichia tuba cytochrome b (cytb) 
Archocentru3 sajica cytochrome b (cy 
Stachyris whiteheadi cytochrome b ge 
Eminia lepida cytochrome b gene, par 
Hypergerus atriceps cytochrome b gen 
Phascolarctos cinereus cytochrome b 
Geomys bursarius osark ens is cytochro 
Georays bursarius majusculus cytochro 
Geomys bursarius bursarius cytochrom 
Geomys bursarius missouriensis cytoc 
Spermophilus citeilus cytochrome b ( 
Bubalus depressicornis cytochrome b 
Dromiciops gliroides cytochrome b ge 
^ _ _ . Vombatus ur3inus cytochrome b gene, 
Tragelaphus eurycsros cytochrome b (cytb) ge 
BCobu3 ellipsiprymnus cytochrome b (cytb) gen 
Antilope cervicapra cytochrome b (cytb) gene 
Tragelaphus oryx cytochrome b (cytb) gene, m 
Antidorcas marsupialis cytochrome b (cytb) g 
Connochaete3 gnou cytochrome b (cytb 



qb| AF009941 . 1 IAF009941 
ob| AF00992S .11 Ar 009925 
cb> AF094gl3 .llAF094g33 
cb ! AF094S21 . 1 1 AF094S21 
cbl AF094gia . 1IAF094S18 
qb 1 AFlSS^a . 1 1 AF1S6343 
qb I AF158697 . 1 I AF1S8697 
qblAFlS3g94.l)AF158594 
qb |AF15flg93 . lfArl58g93 
qb \ AFl53gaa . 1 1 AF1S3S38 
qb | AF10O720 . 1 I AF1QQ720 
qb| AF091632 .11 AF091S32 
g h |A^102a ig.llAF102815 
crb ( AF1023 14 . 1 I AF1023 14 
gH | AFQ220S5 .ll 
rjHlAy022059.ll 
qHlAF0220Sa.ll 
qbfAF022057.ll 
qb|AF022054 .if 

qb I AFQ1SS T7 . 1 1 AF01SS37 - ^ , 

cb |tl698g3 - if PS17$9863 Python s ebae cytochrome b (cytb) gene, 
gb 1 1769844 . 1 1 t,TOS9844 Lichanura trivirgata cytochrome b (cyt 
erhl AF143X93 .11 AF143193 Epinephelus sp. cytochrome b (cytb) 

, Amphiprion ocellaris isolate 8 eyxoc 

Kobus ellipsiprymnus de£a3sa cytochr 
Kobus ellipsiprymnus ellipsiprymus c 
Eulemur rub ri venter cytochrome b (cy 
Eulemur macaco macaco cytochrome b ( 
Eulemur fulvus aibifrons cytochrome 
Elminia longicauda cytochrome b gene 
Hippopotamus araphibiua complete mi 



<ThlAF121 222.1fAF121222 
obf AF09S625 . If AF096S25 
qb | AF09*S24 . 1 f AF09SS24 
abl AF031052 .11 AF0ai052 
qbl AF0 91049 .it AF08104 9 
oblAF0ai04a . ll AF081048 
qb|AF-0a2Q g3 .ltAF032Qg3 
^nbf A^010 9<57,llKAAJ'109S7 



qb 
qb 


U7S504 


. llCCU7SS04 


qb 


O75505 




ab 


U7SS03 


. llAPti7«503 


qb|LT7S508 


. li AIU76508 



qb| AF0349«9 
qb| AFOS137* 


. 1 ! AF0349S9 
. 1 I AF05 1875 


qb| AF0B2007 


.i^afo^oot 


qtjl AF032004 


ll AF06200S 


abl AF032005 


ll AF03200S 


qbi AFQ32004 


;|Af 082004 


abl AF082003 


;|Apo<?^oo? 


qbf AF032002 


ll AF032002 


q^l AFOH2001 


HAF092001 


qfol AFOH2000, 


ilAFO^Opo 




Hafo?;?7? 


qfrfAFOSl^S, 


ilAfgaim 




ll AF081997 




ll AFO* i*7<5 












il*r^ii77i 


? hlAM(ll512. 


JLlAf?1L77i 



Chlanrydera laucerbachii cytochrome b g 
Chlamydera csrviniventris cytochrome b 
Amblyomis subalaria cytochrome b gene 
Archboldia papuensis cytochrome b gene 
Amblyocnis inomatu3 cytochrome b gene 
Cormochaecca taurinua cytochrome b g 
Rhodeua oceliacua cytochrome b (cytb 
Vireo piumbeua plumbeus apecimen-vou 
Vireo piumbeua piumbeua apecimen-vou 
Vireo piumbeua piumbeua apecimen-vou 
Vireo piumbeua piumbeua apecimen-vou 
Vireo pLumbeua pLumoeu3 apecimen-vou 
Vireo piumbeua pLumbeu3 ap«-c imen - vou 
Vireo pLumbrua plumb*-U3 apecim«a-vou 
Vireo piumbeua piumbeua apecimen-vou 
Vireo piumbeua piumbeua apecimen-vou 
Vireo piumbeua piumbeua apec im«n - vou 
Vireo piumbeua piumbeua apecimen-vou 
Viceo piumbeua pLumbeua apecimen-vou 
Vireo piumbeua piumbeua «pec lmen - vou 
Vireo plumtoeu* piumbeua apecimen-vou 
Vireo plumbeu« piumbeua specimen -vou 
vtc«o plumbeu« plumbeu* apecimen-vou 
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ob 1 AF081991 . 1 1 AF081991 Vir«o plumbeua plumbeua specimen - vou 
oblS73150 1IS7315Q cytochrome b {Spermophilua richardsonn- 
gb [ AF012235 ■ 1 1 AF01223S Cryptomya hoctentotua natalenais eye 
cbtUS35ao _llNCa535ao Nycticebua coucang cytochrome b (eye b 
gb | US3S77 . 1 1 EFU53S77 Eulemur Culvua rufua cytochrome b (cyt 
cb I U5357g . 1 1 EFU53576 Eulemur fulvua collaria cytochrome b ( 
gb | U955 12 - 1 1 ESERCYTB2 Epteaicua serotinus 3' cytochrome b ( 
gh|U95509 . 1 1 9KUKLCrr32 Pipiacrellua kuhli S 1 cytochrome b ( 
gb |ui786a .HBTTjg78g8 Budorcaa taxicolor caxicolor cytochrom 
ghltri73S7.l |BTU178S7 Budorcaa caxicolor bedfordi cytochrome 
qh |ai78gQ *i|QOU178gQ Ovia dalli cytochrome b gene, mitochon 
gbtU1785^ - 1 [OCU17859 Gvia canadensis cytochrome b gene, mit 
^mh lAJ-OlO^gg'liASPQlOSSa Acomy3 spinas isaimus mitochondrial 
gh I AF034736 ■ 1 i AF034736 Capra falconer i cytochrome b (cytb) 
chi AF03471Q . 1 1 AFQ34730 Ovis aries cytochrome b (cytb) gene, 
Qh i AF034729 . 1 1 AF034729 Ovis vignei cytochrome b (cytb) gene 
gb AF0 3 472 8 . 1 L Ovis~dalli dalli cytochrome b (cytb) gene, m 
rrhl AFni4727.ll Ovis ammon darvini cytochrome b (cytb) gene, 
ah 1 AF034724 " 1 1 AFQ34724 pantholops hodgscni cytochrome b (cy 
gb 1 AF034722 ' 1 1 AF034722 Addax nasomaculatus cytochrome b (cy 
obIU72033 _l|MMU7203S Monodcn monoceros cytochrome b (cytib) , 
gh (072037. if DLU72Q37 Delphinapcerus leucas cytochrome b {cy 
cb I M99455 " 1 1 MtTOTTCYTB Murexia longicaudata cytochrome b gen 
gb 1 1,29055 1 1 SKPMTDLOOP Sheep mitochondrial cytochrome b (Cy 
■ - Coccyzua americanus cytochrome b gen 

Ant echinus minimus cytochrome b gene 
Antechinus svainaanii cytochrome b g 
Anas undulata cytochrome b gene, par 
Anas auperciliosa rogersi cytochrome 
Anas 3parsa cytochrome b gene, parti 
Anas melleri cytochrome b gene, part 
Anas laysanensis cytochrome b gene, 
• Anairetes f lavirostris cytochrome b 
Mitrospingu3 cassinii cytochrome b ( 
Dacnia cayana cytochrome b (cytb) ge 
^1^^,, Oryx leucoryx cytochrome b gene, mit 

t . 1 / AHU0757g Antechinua haJbbema mitochondrion cytoc 

gK j AF028iao . 1 1 AFtnaTao Urocyon cinereoargenceua cytochrome 
ghl AF028i7a .11 AFQ28178 Paeudalopex sechurae cytochrome b (c 
gh 1 AF028170 . 1 1 AFQ 2 8170 Vulpea zerda cytochrome b (cytb) gen 
gb I M99454 . 1 1 ASWMTSCYT3 Antechinus stuartii cytochrome b gen 
gh I M99451 - 1 1 ASWWTCYTB Ancechinus awainaonii cytochrome b ge 
crbla234gi' ll AJW234gl Ancechinua naao cytochrome b gene, tnxt 
ab(U87l3fl " HTVTJ87138 Trichoaurua vulpecula cytochrome b (cy 
gb I U0759Q . 1 1 PMU07S90 Piaaigale macuiata mitochondrion cytoc 
^mto |AJ-004326 .1IPTaJ432S Phylloacopua trochilua mitochondria 
«blAFQ2Q255 . 11AFQ2Q255 Cyclura nubila cytochrome b (cytb) g 
~mfa f Y19184 . 1 1 LPA19I84 Lama pacoa complete mitochondrial genome 
n h |tjaaa62.1f Amphiiophua citrinellum cytochrome b (cytb) ge 
<-*Htaaa859.11 Thorichthya aureum cycochromc b (cytb) gene, m 
gh traaasa 7lT Thorichthya cr\ aureum cytochrome b (cytb) gen 
^bluaaaST.lt Kerichthya labridena cytochrome b (cytb) gene. 
q H |tJBaaS6.ll Kerichthya carpintia cytochrome b (cytb) gene, 
-mfal Y1QS24 . 1 1 WIMRCEN Macropua robuatua complete m^tochondri 
e r taiaai3 57 . 1 1C5U313S7 Chelydra aerpentina cytochrome b gene. 
^(aaiJS*. llOCOTUSS Chelodina longicoilia cytochrome b gen 
qblg75 3 54 . 1 1 LUU75 3S4 Lepcomyrmex unicolor cytochrome b gene 
gbt Q773 3 2 . 1 IGCU773 32 Gymnorhinua cyanoc-phala cytochrome -b 
embl Z2957 3 . I I OVMTGNME Oldelphis virginiana complete mitocho 
^mtol AJ222679 . 1 1 BTCYTOB aoaelaphua tragocamelua mitochondria 
#>mfc> I A,J2226ao . 1 I TSCTTQS Tragelaphua spekii. mitochondrial cyt 
^|AJ222i5M .llOOCTfTOB Oryx dammah mitochondrial cytochrome 
r7 HlM994<gtf ■ 1 1 PMLMTCYTB Peramelea naauta <r/tochrome b gene, c 
n h|N?l^2 . ifTHNMTCYTB Thylaclnu* cyncoctspha lu« cytochrome b 
^[^^Q.llPAvm-CYTB 0«yuru« hallucatu- cytochrome b gen^ 
g ft LT2373 7 , 1 J PW2373 7 >a C adl«aea mlnoc cytochrome b gene, ml 
r, Hf 1/1320 3 . 1 1 P«UlS2qj P«radl«ae* rudolphl mitochondrion cyto 
^H/l«<H.lleAUlS2<H Cplm^chu- albertl#l mitochondrion c-/to 
a ^| ari20a.llDRU152qq Olphyllode* c^^ubllc* mitochondrion c 
,mbl Ajqqq4 24 , it 3 T^/4 2 4 Sor-iK tundc«rn«l« p^ctl-l mlcochondrl 
f mblAJ00q4 2 1 . Sof« tundc-n«l« partial micochondcl 



ah\ AF082na7.llA?Q82047 
<rb [ XFQ 3 3 3 a S - 1 1 AFQ 3828$ 
gh j Ar 038^34 . 1 I A503 8284 
ebt AF0S9n93 -ll AFQS9093 
gb)AFQ59ng2 .llA?0S9092 
qb I Ag059rtqT- . 1 I AFQ5909I 
gb I AFQ 5 9n a Q - 1 1 AFQ 5 90 80 
gb(AF059fl 7a .lfAF059078 
qblAF052 74Q .1IAF0S2240 
gbl AF0Qg^4Q.ll AF006240 
gb [ AF0Qg227 . 1 1 AF006227 
gb ) AF047447 . 1 I AF047447 
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gmb lAJQ0043a < llSIAJ43 8 Sorex isodon partial mitochondrial c. 
*mb|AJ000437,llSIAJ4 37 Sorex iscdcti partial micochondrial c. 
^1^000428 .11SAAJ429 Sorex arccicua partial micochondrial. 
-mK[ii70Q0427 , 1 1 SAAJ427 Sorex arcticus ssp. waritimenai3 par. 
*mbl AJ000426.1 1SAAJ42S Sorex aaper partial mitochondrial cy. 
<»mb I AJQ0Q425 . 1 1 SAAJ42S Sorex aaper partial mitochondrial cy. 

Sorex granariua partial mitochondria. 
Sorex granariua partial mitochondria. 
Sorex araneu3 partial mitochondrial . 
Hipooiaia caligata 33? . caligaca mi. 
g mbi;wqQ4 '^.u»u^^^ Hippolaia caligata sap. rama mitoch. 
q b |tJl5719 .1IRSU1S718 Ramphocelua aanguinolencus cytochrome . 
gb | LIllOS . 1 1 CGYMTCrraO Cracogeomya gymnurus mitochondrial c. 
gh|tn467q . 1 1 POtJ34S79 Philander opossum cytochrome b light s. 
qh |Lil907.llCG-fMTCrr3r Cratogeomys golcxnaai rube 11 us mitoch. 
gh 1 L11906 ~ 1 1 CGVMTCY73S Cracogeomya merriami mitochondrial c, 
qh |Lll9Q2.llCCWrcyr3A Cratogeomys caacanopa castanops mito. 
IJn | X92524 . l \ St,CYT3~ S . longiroa tria cytochrome b gene (compl . 
Q b)U4S77X .1IAOJ4S771 Anthus campestris cytochrome b gene, n. 
db j [AflQ2l773 .1IAAQ2I773 Anguilla interioris mitochondrial c. 
\ aa nnfi953 . 1 1 A30Qg953 Carassius auratus langsdorfi mxtoca- 



*mb f AJ0QO418.llSCAJ4ia 
gmh 1 AJ-Q QQ417 . 1 1 SCAJ4 17 
emb 1 AJOQu* IS . 1 1 SAAJ41S 
embjAJafll2i3 .11HCAJ4793 
*mblAJ004 792.llHCAJ4792 



rfhi A3QQ6^^J .11 Afluua^3j a-- 

l 7714 92 . 1 1 MT?TacrT3 P.trochilus mitochondrial cytochrome. 

^ ' ' - . , C — r» mi *rt^-Hftrtrf?Tia. _ 



emp lZ7J4^z. ;|niyi,Ruua — ; . - . 

1 aamg219.llA3035239_ Ostecglossum ferreirai mitochondria. 

1 ^ - - - ^_*~ a K ran* ^rnmnl^t . 



r!lrt ABQ3^ZJ^. 1 » AaVJ j j a J z uaus^i-w^*^- - 

-mh ly<*2S3 2.llMMCTT32 M.monocsros cytochrome b gene (complete 

' ■ _ i ■ i m*/«U^m^«*i An ma*f> <* C*/ 



emb I X925 3Z . 1 J wnd pi.monocsiua ^ - - 

^mh I Y7 4 2gQ . 1 1 MrvcCYT3 v.oiivaceua mitochondrion gene for cy. 



emb I X7426Q . 1 iwxv^uj v.gu^sw «»^ww ™ ^ . 

^kI y^?.91 .l(MISLCTT3B s . longiroshis mitochondrion cyts gea. 



^mb |X742SS.l 



^mh |ya2304 .1 



,»mb |X32302 .1 



^mb|XSg294 .1 



^mb|y74252 .1 



^mhl X72005 .1 



-mb lX74259.1 



^mb fYQaai4.1 



^mh |X562g7 .1 



^mb|X74253 .1 



gmb 1X60941 



gmb f X74255 . 



,»mb |XS0340 . 1 



amb I X92S3 0 . 1 ) LJVCTTB L.aiOiroattia W v_ww.*~>- — - 3 = 

ablaQ92gS - llCAU092g5 Cocc/sua americanus mitochoadrion cyto . 
• ^ ♦ l . nn-ti n^»* * 1 (»ucj5oenv3 mi tochondr la! . 



dhi \ A3023906 



Hb j | AB023 90S 



HH-j | AA0239Q4 



^bi ) AB023903 



dbi 1 D83633 . 1 



Hhi I paag3g . i 



dbi I D93S35 . X 



^h» 7 1 099633 . I 



dbi ( 099612 . 1 



, <ftbi 1 099630 . 1 



dbi I 099629 . 1 



dbi ( Q99627 . 1 



tlbi 1094204 . 1 



rtbl ID94202 . 1 



ribi 1Q92993 ■ 1 



^7 |Q92992. i 



(fej |092989. i 



flfal |Q32193 . HgBLmTCB23 8uJ&««lua «mTr: - ^™ "'^ w ' " ~ 



1 ABO 0 4 Q 7 2 



M n i|Aiiqq4q 69.ltAflqQ4<l61 C*OC micocho ndr I* I 0NA Co 

dtol IO994 40, 1 Ano* d«p<*«« tcocn l« ^UochondrUl ONA C. 

^ht tOfil6T7 t LlO<^<'l7 8u |,*;a« ^r.ochondfUt 0NA Cor c 



CTXaJuH Ida a,iguijj.i.wa»i-* v "— - — 7— * - 

MXSLCYT3A S . longiroatri3 mitochondrion cyth ge. 

_ - - _ ■ u-~_,4^<; for* r*v _ 



WiSULiidA a.iongi™».*** »..*-^^. - 

Ml?vcrT3 P.violaceus mitochondrion gene for cy. 



wifvuuj r.vioi4W5ua l "^ k - w -" -» " 

MIPKCYT3G P.hispida mitochondrial cytochrome a. 



MIPFCYT3G P.fasciaca mitochondrial cytochrome . 



Mi?i:urraci i-.taacidua u *-uuww**w**~- — 

MI0ACYT3 O.ariea mitochondrion cytb gene for c. 

- . . . _ j j J rtfl «f»n* f?3^ 



W.anea MiLuv..^ — — 

MIMKCTT3 M.Iceraudrenii mitochondrion gene for - 
MILWCYT3 L.weddelli mitochondrial gene for cyt. 
MILLCYT3 L.ludovicianu3 mitochondrion gene for. 
MIHLCYT3C H. liberiensis mitochondrial cytochro. 
MIHACVT3 H.amphibius mitochondrial cytochrome . 
MIGCCYT3 G.camelopardaii3 mitochondrion cytb g. 
MI5FCYT3 S.fa3tuo3U3 mitochondrion gene for cy. 
MISAC33 3 Epimachus albertisii mitochondrial ge . 
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Database: nt 

Posted date: Mar 2, 2001 12:20 AM 
Number of letters in database: 2,863,327,885 
Number of sequences in database: 807, 597 

Lambda K H 

1.37 0,711 1.31 

Gapped 

Lambda K H 

1.37 0.711 1.31 



Matrix: blaatn matrix: 1 -3 

Gap Penalties: Existence.- S, Extension: 2 

Number of Hits to DB: 19058 

Number of Sequences: 807597 

Number of extensions : 19063 

Number of successful extensions: 7580 

Number of sequences better than 10.0: 2441 

lens tn of query.- 26 

length of database.- 2,853,327,885 

effective KSP length: 17 

effective length of query: 9 

effective length of database; 2,350,098,735 

effective search space: 2S5S0833524 

effective search apace used: 25650883524 

T: 0 

A: 30 

XI: 6 (11.9 bits) 
X2: 15 (29.7 bits) 
51: 12 (24.3 bits) 
S2-. 16 (32.2 bits) 
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Table 12. The other animals belonging to distantly related species analyzed by 
primers to demonstrate its universal nature 

SN Name of the animal 



1 

I • 


Indian black buck no. 1 


2. 


Indian black buck no 2 


3 




4 


Pig 


5 


aog 


6 


chimpanzee (chimss) 


7 


human (humsk) 


8 


Hamster 


9 


crocodile nol 


10 


crocodile no2 


11 


turtle nol 


12 


turtle no2 


13 


mouse 


14 


varanus 


15 


Naga-oaga snake 


16 


Indian elephant 


17 


hen 


18 


dugong 


19 


lizard 


20 


weaver bird nol 


21 


weaver bird no 2 


22 


bufFalo no I 


23 


buffalo no 2 



WO 02/077278 



U6 



PCT/IN01/00055 



CLAIMS 

1 Universal primers named as 'mcb 398 1 and 'mcb 869' capable of amplifying a 
fragment of cytochrome b gene of any animal species in polymerase chain reaction 
(PCR) and revealing the identity of the biological material of any animal of unknown 
5 origin at species and sub-species level, said primers, having the sequences: 

primers name Sequence (5'-3') 

mcb 398 'TACCATGAGGACAAATATCATTCTG" 
mcb 869 "CCTCCTAGTTTGTTAGGGATTGATCG 

2. Primers as claimed in claim 1 wherein the fragment of mitochondrial cytochrome b 
10 gene is capable of significantly discriminating amongst various evolutionary lineages 

of different animal species. 

3. Primers as claimed in claim 1 wherein the fragment of mitochondrial cytochrome b 
gene is flanked by the highly conserved sequences amongst a vast range of animal 
species. 

15 4, Primers as claimed in claim 1 wherein the fragment on mitochondrial cytochrome b 
gene which is polymorphic inter-specifically, but monomorphic at intra species 
sources. 

5. Primers as claimed in claim 1 wherein in Antilope cervicapra species, the sequences of 
the fragment mentioned under claim 1 are as follows: 

20 Mitochondrial cytochrome b gene sequence (398-869 bp) of Antilope cervicapra: 

"taccatgaggacaaatatctttttgaggagcaacagtcatcaccaatctcctttcagcaatcccatacatcggtacaaacctag 
tagaatgaatctgaggagggttctcagtagataaagcaacccttacccgatttttcgccttccactttatcctcccatttatcattgc 
agcccttaccatagtacacctactgtttctccacgaaacaggatccaacaaccccacaggaatctcatcagacgcagacaaa 
attccattccacccctactacactatcaaagatatcctaggagctctactattaattttaaccctcatgcttctagtcctattctcacc 

25 ggacctgcttggagacccagacaactatacaccagcaaacccacttaatacacccccacatatcaagcccgaatgatacttc 
ctatttgcatacgcaatcctccgatcaattcctaacaaactaggagg" 

6. A method for the identification of the animal from a biological sample, said method 
comprising the steps of: 

a) isolating and amplifying the DNA from the biological sample to be tested using the 
30 primers as claimed in claim I, 

b) sequencing the amplified products, 

c) blasting the sequence resolved in step (b) against mito database of National Centre 
for Biotechnology Information (NCBl) using BLAST program and determining the 
most likely family of the animal source of the biological sample. 
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d) blasting the sequence resolved in step (b) against non-redundant (nr) database of 
National Centre for Biotechnology Information (NCBI) using BLAST program and 
determining the most likely genus, species or more precisely the sub-species of the 
animal source of the biological sample, 

e) identifying the most significant alignment of the sequence resolved with 
cytochrome b gene sequence of the animal identified in steps (c) and (d) 
respectively and selection of these animals as 'reference animals' for further 
studies, 

f) isolating and amplifying and sequencing the DNA sequences from the reference 
animal on both strands in triplicate using the primers as claimed in claim 1, 

g) aligning the sequences obtained using CLUSTRAL program and identifying the 
variable sites amongst the animals analyzed, 

h) comparing the nucleotide sequences pair-wise to determine the variation among the 
animals resolved and identifying the nucleotide sequence to which the DNA 
sequence of the biological sample bears maximum similarity as the source animal 
of the biological sample. 

7. A method as claimed in claim 6 wherein the universal PCR protocol works universally 
with the DNA template of any unknown animal origin and the universal primers 
mentioned under column 4. 

8. A method as claimed in claim 6 wherein the Amplification reactions should be carried 
out in 20 pi reaction volume containing approximately 20 r|g of template DNA, 100pm 
each of dNTPs, 1.25 pmole of each primer, 1.5mM MgCh, 0.5 unit of AmpliTa^ Gold 
(Perkin-Elmer-Cetus, USA) DNA polymerase and IX PCR buffer (lOmM Tris-HCl, 
pH 8.3, and 50mM KC1). The amplification profiles followed should be: an initial 
denaturation at 95°C for 10 min, followed by 35 cycles each of denaturation at 95°C for 
45 s, annealing at 51°C for 1 min, and extension at 72°C for 2 min. The extension step 
at 35 th cycles should be held for 10 min. 

9. A method as claimed in claim 6 wherein the method enables identification of species 
of analyzed material (i.e. the DNA isolated from confiscated animal remain of 
unknown origin) using the public databases such as GenBank, NCBI etc. 

10. A method as claimed in claim 6 wherein the method is used for animal identification to 
establish the crime with the criminal beyond a reasonably doubt. 

I I . A method as claimed in claim 6 wherein the method is used to establish the identity of 
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biological materials such as skin, horns etc confiscated from animal poachers, if it is 
that of an endangered species. 

12. A method as claimed in claim 6 wherein the method is used for establishment of the 
identity of confiscated animal parts and products of endangered animal species for the 

5 purpose of production of molecular evidence of animal hunting and related crime in the 

court of law, so that the human violation of the wildlife resources could be controlled, 

13. A method as claimed in claim 6 wherein the method is used to have an idea of the 
geographical location of the commitment of wildlife crime based on the cytochrome b 
gene haplotype of poached animal identified by the universal primer invented. 

10 14. A method as claimed in claim 6 wherein the method is used for animal identification to 
detect the adulteration of animal meat in food products for the purpose of food 
fortification, by the food fortification agencies. 

15. A method as claimed in claim 6 wherein the method is used to provide a universal 
technique for detection of the origin of blood or blood stains etc collected from the 

15 scene of crime related to offences such as murder, rape etc, in order to establish the 

origin of blood found at scene of crime when it sounds as if criminals have wontediy 
spread the blood of an animal at the scene of crime, to confuse the crime investigation 
agencies and forensic scientists with human blood. 

16. A method as claimed in claim 6 wherein the method is used so that it can be converted 
20 to a (a) COMMERCIAL 'MOLECUALR KIT* and (b) 'DNA CHIPS' based 

applications for wildlife identification in forensics. 
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1 



£mcb398* 



J 



Confiscated biological material of 
unknown animal origin (i.e. adil. flesh) 



DNA 



1140 



Mitochondrail cytochrome b gene (11 40 bp) 



mcb869 



PCR amplicons 
(472 bp) 



M 1 



500 bp-*- 




•472 bp 



Sequencing using 

'ABI Prism 3700' DNA Analyzer 




Revealed sequence (328 bp) 



TGAATCTGAGGAGGCTTCTCAGTAGACAAAGCTACCCTGACAC 
GATTCTTTGCCTTCCACTTCATCCTTCCATTTATCATCTCAGC 
TCTAGCAGCAGTCCACCTCCTATTCCTTCACGAGACAGGATCT 
AACAACCCCTCAGGAATAGTATCCGACTCAGACAAAATTCCAT 
TCCACCCATACTACACAATCAAAGATATCCTGGGCCTTCTAGT 
ACT AAT CCTAGCA CTCAT ACT ACT CGT CCT ATT CT CA C CAG AC 
CTGTTAGGAGACCCCGATAACTACATCCCTGCCAACCCTCTAA 
ATACCCCTCCCCATATCAAGCCTGAAT 



Figure 1 a 
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Sequence of cytochrome b gene (328 bp) revealed from biological material 
of unknown origin i.e. 'adil.flesh' using primers 'mcb398' and 'mcb869' 

Homology search in 'nr 1 database using 'BLAST 



http://www.ncbi.nlm.nih.gov/BLAST/ 



Color Key for Alignment Scores 
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Sequences producing significant alignments: 



Score 
(bits) 



gbjAY005809.1l Panthers pardus cytochrome b gene, partial c... 
gb|AF053054.1 |AF053054 Panthera tigris sumatrae isolate Su1 ... 
gb|AF053053.1|AF053053 Panthera tigris tigris isolate B7 mi... 
gb|AFG53050.1|AF053050 Panthera tigris corbetti isolate C2 ... 
gb|AF053049.1|AF053049 Panthera tigris corbetti isolate CI ... 



603 
527 
527 
476 
476 



E 

Value 

e-170 
e-147 
e-147 
e-132 
e-132 



Selection of reference animals based on above information 
and further analysis using primers f mcb398' and *mcb869* 

Multiple sequence alignments 
using ' Autoassembler* 
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AGGC -G Ar -CCaGaCaa* ' a." a - -C-CC 
AGGG-GACCCC -G A «" a AC " AC A * -C -CCC 
-GGG^GiCCCC -GAT AAC * ai;a* •'*. CCC 
-aggag^CCCC -GA- ajlC' aca: -C -CC' 
-iGG^GiCCCC •** A ' a *C * AC A ' C -CC ' 



GCCAACCC - 
-GCCAACCC • 
-GCCAACCC - 
AGC'*ACCCCC 
GCCAACCC- 
GCCAATCC- 
AGCCAACCCC 
-GCCAACCC - 
-GCCAACCC - 
-GCCAACCC - 
.GCCAACCC - 



Figure 1 b 
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Figure 1-c 
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Figure 2 
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Figure 3 
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Figure 4 
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